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Abstract 



This work is about diagrammatic languages, how they can be represented, and what 
they in turn can be used to represent. More specifically, it focuses on representations 
and applications of string diagrams. String diagrams are used to represent a collection 
of processes, depicted as "boxes" with multiple (typed) inputs and outputs, depicted 
as "wires". If we allow plugging input and output wires together, we can intuitively 
represent complex compositions of processes, formalised as morphisms in a monoidal 
category. 

While string diagrams are very intuitive, existing methods for defining them rigorously 
rely on topological notions that do not extend naturally to automated computation. The 
first major contribution of this dissertation is the introduction of a discretised version of 
a string diagram called a string graph. String graphs form a partial adhesive category, so 
they can be manipulated using double-pushout graph rewriting. Furthermore, we show 
how string graphs modulo a rewrite system can be used to construct free symmetric 
traced and compact closed categories on a monoidal signature. 

The second contribution is in the application of graphical languages to quantum infor- 
mation theory. We use a mixture of diagrammatic and algebraic techniques to prove 
a new classification result for strongly complementary observables. Namely, maximal 
sets of strongly complementary observables of dimension D must be of size no larger 
than 2, and are in 1-to-l correspondence with the Abelian groups of order D. We also 
introduce a graphical language for multipartite entanglement and illustrate a simple 
graphical axiom that distinguishes the two maximally-entangled tripartite qubit states: 
GHZ and W Notably, we illustrate how the algebraic structures induced by these oper- 
ations correspond to the (partial) arithmetic operations of addition and multiplication 
on the complex projective line. 

The third contribution is a description of two software tools developed in part by the au- 
thor to implement much of the theoretical content described here. The first tool is Quan- 
tomatic, a desktop application for building string graphs and graphical theories, as well 
as performing automated graph rewriting visually. The second is QuantoCoSy, which 
performs fully automated, model-driven theory creation using a procedure called con- 
jecture synthesis. 



Chapter 1 

Introduction 



Quantum information theory is the study of how data can be encoded and manipulated using mi- 
croscopic systems subject to quantum effects. Over the past two decades, it has grown into a large 
and diverse field, with applications in security, where quantum effects are used to design "unlis- 
tenable" data channels, foundations of physics, where fundamental principles of information are 
used to derive physical theories, and perhaps most notably quantum computing, where classically 
intractable computations such as factorisation of huge numbers can happen in the blink of an eye. 
Virtually all of these applications use quantum theory exactly as John von Neumann described it 
in 1932. However, amidst the increasing scale the problems considered, it becomes clear that this 
is analogous to writing complex computer programs using circuit diagrams. As in the case with 
software development, abstracting away from the low-level is crucial to progress. 

In this dissertation, we seek out this abstraction by identifying and exploiting the behaviour of 
graphical representations of quantum systems. We develop a tool set for graphical reasoning by 
drawing a connection between categorical algebra and graph rewriting. We then show how these 
this tool set can be applied to the description of quantum phenomena using the language of string 
diagrams. 

String diagrams consist of boxes, which represent processes (physical, logical, algebraic, ...) that 
have some inputs and some outputs. Some of those inputs and outputs can be connected together 
using wires. 




The only real requirement we impose on string diagrams is that their "value" (typically as some 
sort of map, relation, or process) is unaffected by topological deformations. Due to the strongly 
physical and spatial qualities of string diagrams, it should come as no surprise that they were 
originally formulated by a physicist. String diagrams originated with Roger Penrose in 1971 [58 1 as 
an alternative notation for contractions of what he called abstract tensors, which are essentially just 
morphisms with some named inputs and outputs. Furthermore, the idea of representing spatially 
and temporally composed processes using these types of diagrams dates back at least to the 1948 
advent of Feynman diagrams l36l . 

String diagrams make sense for any mathematical structure that has a well-behaved notion of 
horizontal (i.e. spatial) and vertical (i.e. temporal) composition. A very general way to formalise 
such structures is to use monoidal categories, which were introduced by Mac Lane [48] to describe 
a wide variety of categories admitting associative, product-like structures (e.g. cartesian products, 
direct sums, tensor products). 

A connection between the notions of string diagrams and monoidal categories was inevitable. 
Twenty years after the introduction of string graphs, Joyal and Street [35J formalised this idea by 
using string diagrams (considered as topological graphs with extra structure) to build free monoidal 
categories. Intuitively, a "free X" is an object for which the axioms of an "X" are true, but nothing 
else. So, a free monoidal category is a monoidal category where two morphisms are equal if and only 
if they are equal by the axioms of a monoidal category. In other words, string diagrams, compared 
up to topological deformations (of a particular kind) exactly represent morphisms compared up to 
the axioms of a monoidal category. 

A monoidal category is a category equipped with a bifunctor (g) : V X V — > V that is associative, 
up to isomorphism and has a left and right unit I E obV. There are many notions of monoidal 
categories with additional structure that have an extremely wide variety of applications in areas 
such as the study of braids and knots, linear algebra and representation theory, quantum field the- 
ory, higher-dimensional algebra, enriched and internal category theory, homotopy theory, linear 
logic, and programming language semantics. We introduce a few of these extended notions of 
monoidal category in chapter [2] namely strict and non-strict (planar) monoidal categories, braided 
and symmetric monoidal categories, symmetric traced categories, left- and right-autonomous cat- 
egories, compact closed categories, t-monoidal (pronounced dagger-monoidal) categories, and t- 
compact closed categories. We offer a summary of what is known about the relationships between 
these kinds of categories, coherence results, and most importantly, graphical language theorems. A 
much more comprehensive collection of graphical language definitions, as well as the state of the 
art in what is and is not known about these languages is available in Selinger 's excellent survey 
paper l62l . 



We also review the notion of abstract tensor networks, roughly in the form it was introduced 
by Penrose in [58] and relate its formulation to monoidal categories and the topological graphical 
languages introduced by Joyal and Street. 

One of the most useful aspects of a monoidal category is that it allows one to define algebraic 
structures internal to a monoidal category. That is, an algebraic structure can be defined as a col- 
lection of morphisms in some monoidal category satisfying some axioms. Since such a definition 
only relies on the structure of a monoidal category, it makes sense in any monoidal category. For 
instance, one can define a monoid in V as a triple {A, }i,n) where A is an object and (i:A®A-jA 
and rj : I — s> A are two morphisms satisfying some equations (namely, associativity and unit laws). 
A monoid in the category of sets and functions is just the usual notion of a monoid, i.e. a unital 
semigroup. In the category of vector spaces and linear maps, it is a unital, associative algebra. In 
the opposite category, it is a counital, coassociative coalgebra. In the category of categories and 
functors, it is a (strict) monoidal category, justifying the intuition that a monoidal category is just a 
"categorified" monoid. In chapter[3l we define algebraic structures internal to a monoidal category 
and give various examples that will be used throughout this dissertation. These include monoids, 
commutative monoids, comonoids, Frobenius algebras, bialgebras, and Hopf algebras. We also 
provide many concrete examples of these algebraic structures, as they occur in familiar (and some 
less-familiar) categories. 

Building on this background, the bulk of the thesis is organised into two roughly independent 
parts. The first part is about applying techniques from the theory of graph rewriting to string 
diagrams. The second part is about applying monoidal category theory and graphical languages 
to the study of quantum mechanics. In particular, diagrams are used to study quantum computing 
and quantum entanglement theory. A third, shorter part focuses on implementing the theoretical 
work from the previous two parts in a program called Quantomatic. 

Part IT] opens with an introduction to rewrite systems in chapter El Rewrite systems provide a 
very general means of reasoning systematically about algebraic theories. In fact, this reasoning is 
so systematic that it can be done by a computer. Rewriting lives at the heart of most computer 
algebra systems (CAS), automated reasoning tools, and proof assistants. The idea behind rewriting 
is very simple. Rather than considering equations (s = t) between terms, as one typically does in 
(universal) algebra, one considers directed reductions called rewrite rules (s — > t). The application 
of rewrite systems from an algebraists point of view is that they can help solve word problems. 

A word problem is a question of the form, "Is term s equivalent to term t by the axioms of an 
equational theory E?" It is well known that word problems are not decidable in general. How- 
ever, given a suitably nice algebraic theory and some elbow grease, it often is possible to solve a 
word problem by turning E into a rewrite system R. If we end up with a nice enough rewrite sys- 



tern, we can solve the word problem by rewriting s repeatedly until no rule from R applies (called 
normalising s), doing the same to t, and comparing the two results to see if they are equal. 

If we just randomly pick directions for each of the equations in E, this technique is very unlikely 
to work. However, if we can find a nice rewrite system (i.e. one that is terminating and confluent), 
normal forms always exist and are unique, and there is an evident solution to the word problem for 
all terms. Thus a large portion of the rewriting literature is about how to go about turning sets of 
equations into nice rewrite systems, turning ill-behaved rewrite systems into better ones, and cop- 
ing with ill-behaved systems using more sophisticated strategies than "normalise and compare". 

Rewrite systems are not just restricted to terms. Just as term rewriting can be thought of us 
replacing certain subtrees (corresponding to subterms) with other trees, we can consider replac- 
ing certain subgraphs with another graph. This is called graph rewriting. In 1973, Ehrig, Pfender, 
and Schneider introduced the double pushout (DPO) approach to graph rewriting |27|. We explain 



this technique in detail and with examples in section 4.2 While DPO rewriting can be formulated 
in many categories (including the category of sets, the category of graphs, and any topos), DPO 
rewriting is not well-defined in all categories with pushouts. In 1979, Ehrig and Kreowski identi- 
fied certain abstract properties of a category with pushouts that allow one to do double-pushout 
rewriting )29|. One abstract formulation of categories in which DPO rewriting makes sense are 
adhesive categories, introduced by Lack and Sobocihski in 2003 l43l . In these categories, pushouts 
involving monomorphisms behave like coverings in the sense that they form so-called van Kam- 
pen Squares. All toposes are adhesive categories, and in a recent result [42 1, Lack showed that any 
adhesive category embeds fully and faithfully in a topos, and that embedding preserves all the ad- 
hesive structure. So, another way to think of adhesive categories is "categories where pushouts of 
monomorphisms behave as they do in toposes". 



In section 4.4 we generalise adhesive categories to partial adhesive categories. These are cate- 
gories C that embed fully and faithfully in an adhesive category S : C — > A., such that S preserves 
monomorphisms. Intuitively, these are categories whose objects are the objects of an adhesive cat- 
egory (e.g. directed graphs) that satisfy certain additional axioms (e.g. simple graphs: at most 
one edge connecting any vertex to another). We then illustrate that the adhesive-like properties 
of pushouts in A are inherited by the S-pushouts in C (i.e. the pushouts in C that exist and are 
preserved by S). As a result, DPO rewriting is well-defined for partial adhesive categories as long 
as one restricts to certain matching morphisms called S-matchings. 

Graph rewriting can be applied to string diagrams, but not directly. This is for the simple reason 
that string diagrams are not graphs in a strict sense. The wires in string diagrams need not be 
connected to boxes at both ends. They can even be connected to themselves to form circles. Wires 
that are not connected to a box at their source serve as inputs for string diagrams, and wires that are 
not connected at their target serve as outputs. Wires that are not connected to a box at either end 



are called free wires, and represent identity maps. A directed graph G consists of a set of vertices 
Vq and a set of edges Eq, as well as total functions s, t : Eq — > Vq. Therefore we cannot represent 
string diagrams as digraphs. Even if we relax the requirement that s and t be total functions, there 
is no way to distinguish circles from free wires. 

We solve this problem by defining string graphs. These are typed graphs whose vertices fall into 
two categories: wire-vertices and box-vertices. The wires in string diagrams are replaced by chains 
of wire-vertices. 




Representing boxes as box-vertices, we can translate string diagrams into string graphs. 




\-> 




In chapter [5] we define the partial adhesive category of string graphs and string graph homo- 
morphisms. We also define special pushouts called pluggings, which are used to plug the outputs of 
one string graph into the inputs of another string graph to form the composed graph. We also de- 
fine string graph rewrite rules in such a way that any monomorphism is an S-matching. Therefore 
double-pushout rewriting is always well-defined. 

The vigilant reader will notice that the correspondence between string diagrams and string 
graphs is nearly 1-to-l. The only obstacle is that wires in string diagrams can be converted to chains 
of wire-vertices of any length. To eliminate this redundancy, we consider two string graphs to be 
equivalent if the only difference between the two is the length of the wires. 
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This corresponds to the wires of the associated string diagrams being homeomorphic, when 
considered as subspaces of topological graphs. For that reason, this equivalence relation is called 
ivire-homeomorphism. We define wire-homeomorphism using a confluent, terminating rewrite sys- 



tem on string graphs in section 5.2.1 



Like their topological counterparts, string graphs can be used to construct free monoidal cat- 
egories. We do this by defining a framed cospan construction over the category of string graphs. 
Recall that for any category C with pushouts, the bicategory of cospans Csp(C) has as objects the 
objects of C, 1-morphisms cospans X — > F <- — Y, and 2-morphisms cospan homomorphisms. 
Composition is performed by pushout, and identities are cospans of identity maps. We form the 
category of framed cospans of string graphs by restricting the objects in the cospan construction to 
discrete graphs consisting of wire-vertices and the cospans X — > G < — Y to maps covering the 
inputs and outputs of G. Composition by pushout then reduces to the intuitive notion of plugging 
together string graphs. 



H 






HoG 



In section 5.5 we show that the free symmetric traced category and the free compact closed 
category on a monoidal signature can be constructed as a category of framed cospans of string 
graphs. By shifting from a topological representation to a combinatoric one, these morphisms can 
be represented straightforwardly on a computer, and they can be manipulated using automated 
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graph rewriting techniques. This is explored in Part III following the introduction of graphical 
theories for quantum computing. 

Part III] describes in detail two graphical theories that are of particular interest for quantum 
computing. These theories were formulated in the context of Categorical Quantum Mechanics 
(CQM), a program initiated by Abramsky and Coecke in 2004 |2j |3| whose purpose was to study 
quantum phenomena from the point of view of monoidal category theory. More than any one 
particular result, CQM represents a set of principles and an approach to the study of quantum 
theory. In this approach, compositionality is at the forefront. CQM asserts that all of the interesting 
and important aspects of quantum theory can be witnessed by studying systems and processes 
and the ways in which they compose. It emphasises the role of compound systems, information 
flow, and diagrammatic reasoning while de-emphasising the role of Hilbert spaces as a crucial 
component to the understanding of quantum phenomena. 

This part opens with a brief introduction for the non-physicist to quantum mechanics, quan- 
tum computing, and quantum information theory in chapter [6] Chapter f7\ introduces categorical 
quantum mechanics and illustrates the role of monoidal categories in quantum teleportation and 
the study of complementary observables. The latter was explored in detail by Coecke and Dun- 
can in 1151 . In quantum mechanics, an observable comes with a basis of orthonormal eigenstates 
corresponding to measurement outcomes. Two non-degenerate observables O and O' are called 
complementary if their associated bases of eigenstates are mutually unbiased. That is, for bases 
{|m;}}, {|z>/)} and for all i,j (where D is the dimension of the space): 

\(Ui\Vj)\ 2 = ^ 

Intuitively, if we measure a quantum state in an eigenstate of O with respect to the O' observ- 
able, we are equally likely to get any outcome. That is, maximal knowledge of a state with respect 
to O implies minimal knowledge with respect to O' . A familiar example of complementary observ- 
ables is position and momentum. 

Mutually unbiased bases can be understood algebraically using particular kinds of interacting 
Frobenius algebras. Frobenius algebras in a monoidal category consist of a monoid (A, Jf , ^ ) 
and a comonoid (A, Jk , 4 ) satisfying the Frobenius identity. 





t-Frobenius algebras are Frobenius algebras whose comonoid structure is the adjoint of the 
monoid structure, i.e. Jk = ( y ) and 4 = ( ? ) ■ 

Commutative t-Frobenius algebras have attracted attention in recent years for exhibiting pre- 
cisely the identities of 2-dimensional cobordisms, whose understanding is a crucial stepping stone 
to the formulation of topological quantum field theories. For more details, see e.g. |5. 41 J. 
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Special Frobenius algebras satisfy an additional identity on the loop map. 



As the name suggests, t-special commutative Frobenius algebras (t-SCFAs) are commutative 
t-Frobenius algebras that are special. Coecke, Pavlovic, and Vicary showed that orthonormal bases 
over finite-dimensional complex Hilbert spaces are in 1-to-l correspondence with t-SCFAs. So, 
rather than studying mutually unbiased bases themselves, we can study their associated t-SCFAs. 
From this point of view, the mutually unbiased basis condition can be summed up in a simple 
graphical identity, where ( TJ , 9 / j\ , 6 ) is the t-SCFA induced by an orthonormal basis and 
, 9/ A. ' 6 ) i s f ne t-SCFA induced by another, mutually unbiased basis. 



V , 



00= (i.i) 



In [15 1, the Coecke and Duncan introduced several stronger forms of complementarity One 
example is the case where the induced algebras of the two bases extend to a bialgebra. That is, the 
following equations are satisfied. 



(1.2) 



In this dissertation, we refer to a pair of observables whose bases satisfy this condition as 
strongly complementary observables. Coecke and Duncan showed that under certain additional as- 
sumptions, the equations in ||1.2| imply (fTTTJ in an arbitrary compact closed category. In section 7.2 




we simplify this result in the case of the category of finite-dimensional Hilbert spaces by providing 
a new proof that | |1.2| always implies (1.1). We also provide a new classification result for pairs of 
strongly complementary observables. 

Theorem. Strongly complementary pairs of observables in a Hilbert space of dimension D are in 1-to-l 
correspondence with the finite Abelian groups of order D. 

Furthermore, we show that it is impossible for three distinct observables to be pairwise strongly 
complementary. This then classifies maximal sets of strongly complementary observables for all 
dimensions. 

In chapter [8] we turn to the application of diagrammatic techniques in the study of multipartite 
entanglement. The classification, computation, and manipulation of complex, many-body entan- 
gled quantum states is one of the most difficult problems facing quantum physicists and quantum 
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information theorists. Any naive approach to the problem of classifying multipartite entanglement 
is doomed to fail, and brute-force calculations involving many entangled quantum systems are 
untenable on today's computers. This suggests the need for more sophisticated techniques that 
capture and exploit as many symmetries and fundamental structure within a quantum system as 
possible. Rather than studying a multipartite state as a single, monolithic entity, we study it in 
terms of its components and explore how those components interact. We call this the compositional 
approach to multipartite entanglement. 

By way of the Choi-Jamiolkowski isomorphism, we can consider quantum states and processes 
on the same footing. In that sense, a bipartite quantum state in H (g> H can be thought of as a quan- 
tum channel from H to H. Similarly, we can treat a tripartite state as a map from H ® H to H, i.e. a 
binary operation on quantum states in H Nearly all algebraic objects of interest (e.g. groups, rings, 
vector spaces) are sets equipped with one or more binary operations satisfying certain axioms. For 
that reason, we adopt a motto: "Just as binary operations have a special status in the study of 
algebra, so too should tripartite states in the study of multipartite entanglement." 

To justify this assertion, we develop a methodology for representing and studying arbitrary 
qubit states using tripartite states as building blocks. It is a well-known result from quantum en- 
tanglement theory that there exist two canonical, genuinely-entangled tripartite states over qubits, 
up to equivalence by stochastic local operations and classical communication [26 J. These states are 
the Greenberger-Horne-Zeilinger (GHZ) state and the W stateF] 

| GHZ) = -= (|000) + |111)) |W) = -jp= (| 100) + |010) + |001)) 

\/2 V3 



In section 8.2 we identify two properties shared by GHZ and W states, which we call strong 
symmetry and strong SLOCC-maximality. Strongly symmetric states are symmetric states that extend 
naturally to larger symmetric states on any number of systems. For instance, the N-partite versions 
of GHZ and W are defined as: 

|GHZ N ) := |00...0) + |ll...l) 

\W N ) := |10. . . 0) + |010 . . . 0) + . . . + |0 . . . 01) 

SLOCC-maximal states are states that are maximal with respect to conversion by stochastic local 
operations and classical communication. That is, |Y) is SLOCC-maximal precisely when any state 
| Y') that can be converted into | Y) by way of a SLOCC protocol must already be SLOCC-equivalent 
to |Y). Strongly SLOCC-maximal states are multipartite states that are inductively SLOCC-maximal. 
That is, | Y) is a strongly SLOCC-maximal N partite state if it is SLOCC-maximal and it is possible 
to obtain a strongly SLOCC-maximal (N — 1) partite state from |Y) by projecting out any of the 
subsystems H. 



1 W states were also first introduced by Greenberger, Horn, and Zeilinger in 1991 1 66 \, but they were not named until Diir, 
Vidal, and Cirac highlighted their significance in 2000 ]26|. It is generally believed that the W is for Wolfgang (Diir) |13|. 
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States that satisfy both of these conditions are called Frobenius states. In section 8.3 we show that 
any Frobenius state extends to commutative Frobenius algebra. Conversely commutative Frobe- 
nius algebras can be used to construct a Frobenius state. The commutative Frobenius algebra Q 
associated with the Frobenius state \GHZ ) is: 



9 = V2 



10} 



= |0)(00| + |1)(11| 
^ = |00)<0| + |11)<1| 6=V2(+| = (0| + <1| 

For the Frobenius state |W), the associated Frobenius algebra W is: 

y = ii> (ni + io> (oi| + |o> <ioi f = |i) 

X = |00)(0| + |01)(l| + |10)(l| i = (0| 

In [ 19], Coecke and Kissinger produced a unique characterisation of GHZ and W states in terms 



of properties of their associated Frobenius algebras, which is summarised in section 8.3.1 We 
highlight two types of commutative Frobenius algebras based on the value of the "loop map". 



SCFA 



o 



ACFA 



k 
? 



We prove that commutative Frobenius algebras are special if and only if their associated tripartite 
states are SLOCC-equivalent to GHZ. Similarly commutative Frobenius algebras are anti-special if 
and only if their associated tripartite states are SLOCC-equivalent to W. Thus, the two canonical 
tripartite qubit states can be distinguished by two simple graphical identities. 

Taking inspiration from the interaction properties of GHZ and W states, we define a GW-pair as 
a pair of commutative Frobenius algebras (one special, one anti-special) satisfying certain graphical 
identities. 



A=H Y = il 



O 



?? 



o 



u 



We prove that the axioms of a GW-pair subsume the axiomatisation for GHZ and W states 
given in ||T9l . Again inspired by the example of the GHZ and W states, we introduce the notion of 
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a distributive GW-pair. Distributive GW-pairs behave similarly to rings, in that the "multiplication" 
induced by XJ distributes, up to a scalar factor, over the "addition" defined by \f . 



(1.3) 



Using a GW-pair, we can construct the abstract analogue of a CNOT gate, and verify graphically 
that it behaves as a CNOT. For the specific GW-pair defined by \GHZ } and \W), this is actually a 
CNOT gate. Using this fact, we prove that the generators of the pair (Q, W) are universal for quan- 
tum computation. By Choi-Jamiolkowski, this means they can also be used to construct arbitrary 
multipartite qubit states. 

The precise sense in which the GHZ algebra behaves like multiplication and the W algebra 



behaves like addition is explained in section 8.4.3 Qubits defined on the Bloch sphere can equiva- 



lently be considered as points on the complex projective line CP . We can define (partial) addition 
and subtraction operations on the points in CP , considered as the set C with an additional point 
at infinity. 



k ■ oo = oo 


A: + oo 


= oo 


O-oo = _L 


O + oo 


= 00 


00 • 00 = oo 


00 + 00 


= _L 



The GHZ algebra corresponds to the multiplication operation on CP , and the W algebra corre- 
sponds to addition. For unitary elements, multiplication distributes over addition. The failure of 



distributivity for oo in CP (i.e. distributivity up to a non-zero scalar) is reflected in equation 1 1.3 1 
by the fact that k — when |fl) = |oo). 

There is still much to be learned about the algebras induced by quantum states, but already 
the compositional approach has yielded insights about the GHZ and W states that would not have 
been possible otherwise. One could picture quantum algorithms or protocols that leverage the be- 
havioural qualities identified in this dissertation. For instance, treating inputs to graphs consisting 
of T|r and XJ as variables, we can think of such graphs as encoding polynomials in quantum 
states. Work is in progress to apply this insight to the development of quantum algorithms for 
hard problems such as finding the roots of diophantine polynomials. 



In Part III we introduce Quantomatic and QuantoCoSy, which are software tools for working 
with string graphs. Quantomatic allows a user to create and modify string graphs, graphical theo- 
ries, and string graph rewrite systems. It also lets one selectively apply rewrite rules and normalise 
graphs with respect to a rewrite system. QuantoCoSy is a tool for synthesising new graphical theo- 
ries from concrete models using a technique called conjecture synthesis. This procedure, introduced 
by Johansson, Dixon, and Bundy in 2010 1 33 ], is a procedure for enumerating and checking equality 
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for all terms of a certain size in an algebraic theory. The thing that makes this technique so effective 
is it builds a rewrite system dynamically during the enumeration procedure and actively avoids 
checking for redundant equalities, i.e. those that are already derivable using the rules it has discov- 
ered previously. It does this by only enumerating terms that are irreducible with respect to a rewrite 
system. QuantoCoSy adapts this technique from term rewrite systems to string graph rewrite sys- 
tems, and is showing potential to be a valuable tool in the generation of graphical theories from 
concrete, linear algebraic models. 



In chapter 10 we review the major results of the dissertation and discuss future work, particu- 



larly in the area of automation. 
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Chapter 2 

Monoidal Categories 



It is often useful to reason in a very general sense about processes and how they compose. Category 
theory provides the tool to do this. A category consists of a collection of objects A, B, C, . . ., a collec- 
tion morphisms f, g, • ■ • , an associative operation o for (vertical) composition, and for every object 
A an identity morphism \ A . Objects can be thought of as types. They dictate which morphisms can 
be composed together. We shall primarily be interested in categories that have not only a vertical 
composition operation, but a horizontal composition as well. Such categories are called monoidal 
categories. 

Definition 2.0.1. A monoidal category consists of a category V, an object I 6 V called the monoidal 
unit, a bifunctor <g> : V x V — > V called the monoidal product, and natural isomorphisms oca,B,C '■ 
A <g> (B <g> C) -> {A®B)®C, X A : I ® A -> A, and p A : A <g> I -> A, such that Aj = pi and the 
following diagrams commute: 

A®(B®(C®D)) — ^ (A®B)®{C®D) — ^— ((A ® B) ® C) ® D 



A® a 



A®({B®C)®D) 



a<g)D 



(2.1) 



(A®(B®C))®D 



A® (I® B) 



(A® J)®B 




(2.2) 



We shall refer to (®, a, A, p) as the monoidal structure of V. We often drop a, A, and p when they 
are clear from the context. Monoidal categories where all three natural isomorphisms are actually 
equalities are called strict monoidal categories. 

Examples 2.0.2. The condition of being a monoidal category is very weak. Most categories of 
interest admit at least one monoidal structure, and many admit several. Some examples: 



18 



• (Set, x, 1): the category of sets and total functions with the cartesian product x and the one- 
element set 1 make Set into a monoidal category. 

• Disjoint union + and the empty set form another monoidal structure on Set. 

• More generally any category with finite products or coproducts is monoidal. 

• ( Vectx/ <8>, K): The category of K-vector spaces and K-linear maps is monoidal, with monoidal 
product taken as tensor product of vector spaces and tensor unit K, the 1-dimensional space. 

• (FVectjo 0, K): The same as above, but restricted to finite-dimensional vector spaces. 

• (Mat(K), <8>,1): The category whose objects are natural numbers and whose arrows M : m — > 
n are n x m matrices taking values in K. Composition is matrix multiplication, the monoidal 
product is multiplication of natural numbers (on objects) and the Kronecker product of ma- 
trices (on arrows). This category is essentially FVectx, with a chosen basis for all of its objects. 

• (Rel, x , 1): the category of sets and relations. Note that the cartesian product x is a monoidal 
product, but not a product in the categorical sense. 

• (Rel, ©, {}): where © is the disjoint union of sets (on objects) and the disjoint union of rela- 
tions (on arrows). We write the disjoint union using the © symbol to highlight the fact that it 
is actually a biproduct in Rel. As such, it is automatically a monoidal product. 

In any monoidal category, a, A, and p can be used to construct a natural isomorphism from some 
object to any other bracketing of that object, with or without monoidal units. E.g. 

(A <g> I) <g> (B <g> (I <g> C)) = (A <g> (B <g> (C <g> I))) 



It was shown by Mac Lane that the equations in Definition 2.0.1 suffice to show that any such 
natural isomorphism is equal to any other one [48]. Such a theorem is known as a coherence theorem, 
and it was the first of many concerning monoidal categories. By a minor abuse of notation, we shall 
often treat monoidal categories as if they were strict. That is, we often omit brackets, a, A, and p, 
simply assuming they are included where necessary. Coherence assures us that we can omit these 
details without ambiguity. 

Though we shall occasionally use normal, algebraic notation for morphisms in monoidal cate- 
gories, it is often vastly more convenient to use a graphical notation. In fact, the majority of this 
dissertation concerns formalising and exploiting graphical notation. For now, we shall treat this 
notation informally and fill in the details later. We represent objects as labelled wires: 
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Morphisms can be thought of as processes. A morphism takes something of type A and pro- 
duces something of type B. For that reason, we'll draw morphisms as boxes with a wire coming 
in labelled with a morphism's input type and a wire going out labelled with a morphism's output 
type. 



D 



Identity morphisms are special "do nothing" processes, which take something of type A and 
return the thing itself. We represent these as empty wires. 



Morphisms are composed by plugging an output wire into an input wire. 

















A 




B 


A 


f 


8 


o 


/ 


= 


B 




C 


B 


g 




C 



Implicit in this box-and-wire notation is the assumption that composition is associative. 

A 





















J 


( 






















c , 




B 




A \ 


B ( 


c 




B 


\ 




A 


h 


' 


8 


o 


f 


~ 


g 


- 


h 


o 


g 


° 


/ 




D 




C 




B 


C 


D 




C 


/ 




B 



...and unital. 







A 


A 


A 


B o 


/ J 


( = J 


C o 






B 


B 


B 



We express the monoidal product of two objects as juxtaposition of wires. 



A ® 
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.and the monoidal product of morphisms as the juxtaposition of boxes. 





A 


A' 


A 


A' 


/ ® g - f g 




B 


B' 


B 


B' 



The monoidal product is also associative and unital, but possibly only up to isomorphism. We 
denote the monoidal unit / as the empty graph. Note that the bifunctoriality of the tensor product 
is implicit in this notation. 

(g®g')o(f®f') = (gof)®(g>of>) 



/ /' 



/ /' 



/' 



Ic Tc 



/' 



Ic T* 



The following proposition is a simple consequence of binfunctoriality: 
Proposition 2.0.3. For any morphisms f : A — >■ B and g : A' — >■ B' in a monoidal category, 

(B®g)o(/®A') = {f®B')o(A®g) 



Proposition 2.0.3 can be interpreted graphically by "sliding boxes" past each other: 



/ 



A' 



u 



(2.3) 



/ 
Tb' ^ 

It was proved by Joyal and Street that planar box-and-wire diagrams can unambiguously rep- 
resent morphisms in a monoidal category. They showed furthermore than this representation is 
sound and complete with respect to the algebraic definition of a monoidal category. This and simi- 



lar results will be discussed at length in section 2.3.2 



2.1 Types of Monoidal Categories 

So far, we have introduced monoidal categories. These are sometimes referred to as planar monoidal 
categories, as diagrams of morphisms are always planar. However, many monoidal categories 
come with a notion of "crossing" wires. The weakest such category is called a braided monoidal 
category. 
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Definition 2.1.1. A braided monoidal category is a monoidal category (V, ®, I, a, A, p) with an addi- 
tional natural isomorphism ^ a,b '■ A<g> B ^ B <g> A called a braiding, such that p^ := A a o j A[ and 
the follow diagrams commute: 



7®C 



(A®B)®C 



(B®A)®C 



B®(A®C) 




B®(C®A) 



(2.4) 



A®(B®C) 



(B®C)®A 



7~ 1 <g)C 



(A®B)(E)C 



(B<g>A)<g>C 



B(g>(A<g>C) 




B® (C® A) 



(2.5) 



„-i 



A® (B<g>C) 



(B®C)® A 



The braiding 7 and its inverse 7 are drawn as wire crossings. Note how one wire is explicitly 
drawn over the top of the other. 





This is to emphasise that 7 may not be equal to 7 . That is, we cannot simply pass wires 
through each other in the graphical language. However, the naturality of 7 and the diagrams from 
Definition |2.1.1 suffice to prove any equation about morphisms in a braided monoidal category 
that we can prove geometrically with braid diagrams. In particular, 7 satisfies the Yang-Baxter 
equation: 
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We shall primarily be interested in a special case of a braided monoidal category called a sym- 
metric monoidal category. 



Definition 2.1.2. A symmetric monoidal category (SMC) is a monoidal category with a braiding a 

-l 
3, A' 



such that a j^ g = <r R 



When a braided monoidal category is symmetric, we refer to the braiding a as the symmetry 
map. To emphasise the fact that c^ g = <TZ\, we do not distinguish over- and under-crossings: 




Examples 2.1.3. All of the categories from Examples 2.0.2 are symmetric monoidal categories. 

• (Set, x, {*}), with u^b : A x B — s> B x A defined as the canonical swap map (a, b) n> (b,a). 

• (Set, +,{}), with u^ B : A + B — S> B + A the map that interchanges the two components of the 
disjoint union. 

• For any category with finite products, the projection maps induce a canonical symmetry map: 



AxB 




• The swap map is induced similarly in any category with finite coproducts. 

• ( Vectj-, Cgi, A;) is an SMC, with a the tensor swap map. I.e. it is the linear extension of oy w (v <8> 

w) =w®v. 

• (Rel, x , { * } ) is an SMC, with <r defined as the swap map of the cartesian product. 

We can interpret any progressive diagram (i.e. a diagram with no feedback loops) as a mor- 
phism in a symmetric monoidal category. Like in the case of planar monoidal categories, the axioms 
of a symmetric monoidal category ensure that there can be no ambiguity. 
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(h® F) o (E ® <r) o (g ® D) o f ( (h o a) <g> F) o (D ® g) o a o f 

The natural next question to ask would be, "Is there a meaningful way to interpret diagrams 
with feedback loops?" The answer to this question is yes. There are actually two meaningful ways 
to do this. The first is a traced category, and the second, which subsumes the first, is a compact closed 
category. 

Definition 2.1.4. A symmetric traced category V is a symmetric monoidal category with a function 

Tr x : hom v (A <g> X, B <g> X) -> homy (A, B) 
defined for all objects A, B, X, satisfying the following five axioms: 

1. Tr x ((g®X)ofo(7z<g)X)) =goTr x (/)o/z 

2. Tr Y (/o(A®^))=Tr x ((B®^)o/) 

3. Tr 7 (/) = / and Tr x ® Y (/) = Tr x (Tr y (f )) 

4. Tr x (g®/)=g®Tr x (/) 

5. Tr x (cr x , x ) = l x 

We refer to Tr x is the trace operation and X as the object being traced out. 



We depict this graphically by connecting the X-output of a map f : A <g> X 
X-input. 



B ® X to the 



Axioms 1 and 4 are implicit in this notation, since we do not draw the bounds of the trace 
operation. Axiom 2 is a "box-sliding" identity: 
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/ 



A 


8 


f 



A special case (A = B — I) of this axiom is the familiar property of matrix traces in linear 
algebra: Tr(MN) = Tr(NM), i.e. the value of the trace is not affected by cyclic permutations. 
Axiom 3 makes the trace operation respect the monoidal product on objects: 



/ 



I = / and 



X®Y = 




Axiom 5 allows us to pull out loops in the diagram: 




Examples 2.1.5. There are at least two ways in which the category Rel can be made into a symmetric 



traced category: one for each of the monoidal products defined from Examples 2.0.2 



• (Rel,®): For a relation R:AxX->BxX, we define a new relation Tr x (_R) : A — >■ B as 
follows: 

,X/ 



a 



Tr x (R) b<=>VxeX.(a,x)R(b / x) 



Thinking of a relation as a matrix over the booleans, this is analogous to the usual partial 
trace of a matrix. 

• The trace for (Rel, ©) was defined in [34]. Let R : A © X ->■ 6 © X be a relation. We define 
the trace as: 



a 



Tr x (R) b <=> (aRb) V (3x,x' 6 X . aRx A x'Rx A x'Rb) 



The new relation incorporates "feedback" from the X-output of R to the X-input of R via term 
"x'Rx" on the RHS. 

Example 2.1.6. The category FVect^ of finite-dimensional vector spaces and linear maps is a traced 
monoidal category with Tr given by the partial trace operation on a linear map. Suppose / : 
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A(g)X— >B(g)Xisa linear map. Then, by fixing bases x l E X, a 1 E A and b 1 E B, then f is uniquely 



determined by an indexed collection f/\ E K called a tensor. 

f{a i ®xJ)=Y J f{[b k ®x l 
k,l 

We can then define a new tensor by summing together the lower X-index with the upper X- 
index. 

k 
This new tensor defines a linear map from A to B: 

Tr X (/)K)=E/F 

; 

Equivalently for x, E X* the corresponding basis of the dual space of X, we can define the 
partial trace as: 

TrX (/) :=E(B«)x I ) / (^®^) 

i 

When A — B = K, this is just the usual trace of a matrix. With this is mind, we can see why 
FVectx is an example of a traced monoidal category, but Vectjc is not. For an infinite-dimensional 
vector space V, Tr (ly) is also infinite. In particular, it is not an element of hom(K, K) = K. 

Note how the dual space plays a role in the definition of the trace. The dual space is actually a 
special case of a general categorical notion called a dual. 

Definition 2.1.7. Let A and A* objects in a monoidal category. A* is called the right dual of A 
(equivalently, A is called the left dual of A*) if there exist maps d& : A <S> A* — > I, e& : I — > A* (g> A 
satisfying the "line-yank" identities: 

A* 





d A ®A 
e& is called the cap, and d& is called the cup, of the compact structure. 

In the graphical notation, the object A* is represented as a wire labelled A, but directed upward 
instead of downward. 



A := 



A* := 



We represent e/{ and d^ as half-turn of wire, forming a cup or a cap. 
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e A ■- 



The diagrams from Definition 2.1.7 are called the "line-yank" identities, because their graphical 
representations literally look like pulling a wire straight. 



Definition 2.1.8. A monoidal category where every object has a right (resp. left) dual is called a 

right (resp. left) autonomous category. 

Definition 2.1.9. A corn-pact closed category is a category that is right autonomous and symmetric. 
For an object A in a compact closed category the dual maps d A and e A are called a compact structure 
for A. 

Note that compact closed categories are automatically left autonomous. Any right dual A* oi A 
can be made into a left dual by choosing maps d' : = d o a and e' :— aoe. The left line-yank identity 
can then be derived from symmetry and the right line-yank. 




Also note that in a compact category and map f : A — > B can also be considered as a map 
f* : A* — »■ B* by using caps and cups to "bend the wires" around. 



/ 



/* 

Example 2.1.10. The category FVectj^ is compact closed. For any finite-dimensional vector space 

A, fix bases a' 6 A and a, 6 A* such that a> o a 1 = S,. Then, define a compact structure for A as 

follows: 

d A :: fl; (g> a) h-> (a, o a 1 ) e A :: 1 i->- Y^ a i ® fl! 

i 

Compact closed categories are automatically symmetric traced categories. A cap and a cup can 

be used to construct a "feedback loop" that acts as a trace operation. 



Tr x (/) := {B®d x )o{f®X*)o{A®e' x ) 



(2.6) 
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The axioms of a compact structure then suffice to prove the five trace axioms given in Definition 



2.1.4 Therefore, compact closed categories subsume symmetric traced categories. Often these 
categories are easier to work with than their traced counterparts, especially using the graphical 
language. It would be convenient to use the compact structure axioms when proving identities in 
symmetric traced categories. This is possible due to a result by Joyal, Street, and Verity. 

Theorem 2.1.11 ( 11341 ). Any symmetric traced category can be fully and faithfully embedded in a compact 
category. 

They prove this result by defining the free compact closure of a symmetric traced category, using 
a technique called the "Int construction". For a symmetric traced category V, they build a compact 
closed category Int(V) into which V embeds fully and faithfully. Thus / = g in V if and only if 
f — gin Int(V). This construction is also free over V, i.e. for a compact closed category V', a traced 
symmetric functor F : V — » V' extends uniquely to a compact closed functor F : Int(V) — > V'. 

We shall introduce one more type of monoidal category, introduced by Abramsky and Coecke, 
for the sake of reasoning about quantum information (3). 

Definition 2.1.12. A category C is called a t -category if there exists an identity-on-objects functor 

(-) + : C -> C°P such that ((-) + ) + = 1 C - 

In particular, t-categories are always isomorphic with their opposite category. As the notation 
might suggest, the t functor is an abstract version of the conjugate-transpose of a complex linear 
map. We can use it to define an abstract notion of unitarity 

Definition 2.1.13. A morphism (J in a t-category is called unitary if it is an isomorphism and U + = 

u-\ 

t-monoidal categories are simply monoidal t-categories, where all of the structural natural iso- 
morphisms are actually unitary isomorphisms. 

Definition 2.1.14. A t-symmetric monoidal category is a symmetric monoidal category where all 
of the components of a, A, and u are unitary. 

t-compact closed categories have the additional property that the t reflects caps and cups ver- 
tically. 

Definition 2.1.15. A t-compact closed category is a compact closed category where all of the com- 
ponents of a, A, and a are unitary and for all A, d A = e' A and e A = d' A . 

Examples 2.1.16. Several examples of t-compact closed categories are: 

• (Rel, x ), with t given by relational converse: aR f b ■£=> bRa 
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(Mat(K), (8>), with t given by transposition of matrices 

(FHilb, <S>), the category of finite-dimensional complex Hilbert spaces. The compact structure 
is the same as for FVectc and t is given by the adjoint of linear operators with respect to 



The category Hilb is a t-symmetric monoidal category but it is not traced (and hence not com- 
pact closed). 

For each of the categories we have introduced, we can form the category of (symmetric, traced, 
compact, ...) monoidal categories and suitably structure-preserving functors. 

Definition 2.1.17. For monoidal categories C, T>, a strong monoidal functor consists of functor 
F : C — > V an isomorphism cp : I — > F(I) and a natural isomorphism ipA,B '■ FA® FB — > F(A ® B) 
such that following diagrams commute: 



ip®FC 
(FA <g> FB) ® FC - F(A ®B)®FC 



FA® ip 
FA ® (FB ® FC) FA ® F(B ® C) 



1> 



FA® I 
FA®(p 
FA®FI 



FA 



4' 



HP 



F(A®I) 



-i\ 



I® FA 
$®FA 
FI®FA 



F((A®B)®C) 
F(«) 

F(A®(B®C)) 
A 



FA 



>P 



F(A-l) 



F(I®A) 



The adjective strong is used to distinguish from a /ax monoidal functor, where the isomorphisms 
are replaced by arbitrary maps. In the case of strict monoidal categories, this definition simplifies, 
as coherence diagrams need not be considered. 

Definition 2.1.18. For strict monoidal categories C, V, a strict monoidal functor is a functor F : C — > V 
where I = F(I) and F(A ®B) =FA® FB. 

For symmetric, traced, compact closed, and t-compact closed categories, we can make similar 
definitions, where the functor application must commute with all of the structure in sight. There are 
also strict versions of all of these categories, where the associativity and unit natural isomorphisms 
are all identities. 

Definitions 2.1.19. The following are all categories of (small) monoidal categories: 

• MonCat: the category of monoidal categories and monoidal functors 

• SymMonCat: the category of symmetric monoidal categories and symmetric monoidal func- 
tors 



29 



• SymTraceCat: the category of symmetric traced categories and symmetric traced functors 

• CCCat: the category of compact closed categories and compact functors 

• MonCat s , SymMonCat s , SymTraceCat s , CCCat,: the strict versions 

2.2 Free Monoidal Categories 

An important question for monoidal categories is as follows. 

Given a suitable description for the generators of a (symmetric, traced, compact closed, ...) 
monoidal category, can we generate the free such category? 

This question is important because two arrows f and g are equal in a free monoidal category if 
and only if their equality can be established only using the axioms of that category. We can make 
this precise by expounding on the usual universal property satisfied by free objects, but first, we 
define the notion of a monoidal signature, which defines the generators of a monoidal category. 

Notation 2.2.1. For a set O, let w(0) be the free monoid over O, i.e. the set of lists with elements 
taken from O where multiplication is concatenation and the unit is the empty list. For a function 

f :0 ->0', let w(f) : w(0) ->■ w(0') be the lifting of / to lists: 

w(f)([A,B,C,...]) = [f(A),f(B),f(C),...} 

Definition 2.2.2. A (small, strict) monoidal signature T = (0,M, dom, cod) consists of a set of 
objects O, a set of morphisms M, and a pair of functions dom : M — > w(0) and cod : M — > w(0). 

The maps dom and cod should be interpreted as giving input and output types to a morphism 
m E M. For instance, if dom(m) = [A, B, C] and cod(m) = [D], then m represents a morphism 
from A (g> B (g> C to D. The empty list is interpreted as the tensor unit J. 

Example 2.2.3. Define a monoidal signature T = (0,M, dom, cod) where 

= {A,B,C} 
M = {f,g} 

dom={f^[A,B],g^[C}} 
cod = {/ -> [C], g H- [C]} 

This signature defines three (primitive) objects A, B, C and two morphisms f : A ® B — » C and 
j : C ^ C. We will often write a signature as a set of boxes, representing the diagram generators: 



T:~- 



A B 

I I 


C 

I 


f 


, 


g 


C 


c 
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There is also a notion of a non-strict monoidal signature. In that case, w(0) is replaced with the 
free (®, I)-algebra over O. We treat the strict case for simplicity, but many of the results translate 
immediately, replacing equality with coherent natural isomorphism. 

Definition 2.2.4. For monoidal signatures S, T, a monoidal signature homomorphism f consists of 
functions fo ■ 0$ — >■ Oj and /m '■ M$ — s> Mj such that the following diagrams commute. 

cod s 



dorrts 
M s w(p s ) 



/m 
M T 



™{fo) 



w(Oj 



M s 
M T 



w(O s ) 

M/o) 
w(O t ) 



domj- codj 

MonSig is the category of monoidal signatures and monoidal signature homomorphisms. 

A monoidal signature is essentially a strict monoidal category without composition or identity 
maps. A monoidal signature homomorphism is thus a monoidal functor, minus the condition that 
it respect composition and identity maps. 

Definition 2.2.5. A monoidal signature is called simple if the images of dom and cod are restricted 
to single-element lists. 

There are evident forgetful functors from MonCat s , SymMonCat s , SymTraceCat s , and CCCat s 
into MonSig. If this forgetful functor has a left adjoint F, the image of a signature T under F is 
called the free monoidal category over T. 

To get a better feel for these objects, we unroll the universal property of the free category. Fix 
a monoidal category V. Then, for a monoidal signature T, a monoidal signature homomorphism 
from T to LT(V) is called a valuation. This homomorphism gives a value in V for each of the gen- 
erators in T. The universal property of the adjunction then guarantees there is a unique strong 
monoidal functor v : FT — S> V such that: 




In the non-strict case, v is only unique up to unique, coherent natural isomorphism. In (37|, 
Kelly and Laplaza gave a prescription for constructing the free category on any "algebraically- 
defined" additional structure on a category. They went on to describe concretely the free compact 
closed category on a category (or equivalently, a simple signature). Shum proved a similar result 
in 1994 [63J for tortile categories, i.e. braided monoidal categories with coherently-defined left and 
right duals. In the next section, we will discuss work to define the free symmetric, traced, and 
compact categories on an arbitrary signature, using graphical language. 
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2.3 Formalising Graphical Languages 

There are three ways in which one can formalise graphical languages for monoidal categories. The 
first formalisation is algebraic, where string diagrams are used as an equivalent representation of 
a tensor network defined using the abstract index notation introduced by Penrose in his 1971 paper 
[58 1 . The second formalisation is topological. In this approach, topological graphs (i.e. realisations 
of ID simplicial complexes) with added structure are used to represent morphisms [35]. The third 
formalisation is combinatoric. A special kind of typed graph called a string graph is used to repre- 
sent morphisms. This approach was developed by Dixon, Duncan, and Kissinger [22, 24J. In this 
section, we'll discuss the first two approaches. We'll discuss string graphs in detail in Partfi] 

2.3.1 Algebraic Approach: Abstract Tensor Systems 

In this section, we will look at the original formulation of string diagrams, due to Penrose in 1971 



Recall that a tensor is a set of real or complex numbers, indexed by one or more natural numbers. 
For example, the following is an (n\ ■ n?_ ■ Tridimensional tensor: 

{x k ij ■ i = i-«i; j = i-«2; k = i..n 3 } 

Subscripts should be thought of as inputs and superscripts as outputs. Familiar examples of 
tensors are vectors, v 1 and matrices, M.L We can compose tensors by contraction, i.e. "summing 
together " a lower index and an upper index of the same dimension: 

kl 
As expected, when we focus on vectors and matrices, we recover the usual notions of composi- 
tion and application of linear maps. In order to simplify such expressions, we can use the Einstein 
summation convention, where any repeated indices are assumed to be summed over. 

Penrose introduced abstract tensor systems to express generalised tensors and contractions. In 
many ways, this formalism resembles that of monoidal categories. Natural number indices are 
replaced with formal labels, which are simply names that can be used to identify inputs and ouputs. 
These are taken from a labelling set. 

C = {a,b,c, . . . ,a ,b , . . . ,a 1 ,b 1 , . . .} 

Vector spaces are replaced by sets of formal tensors. For two lists of labels U, L the set TP has 
as elements formal tensors. For L = {a$, . . . ,a m } and U = {bo,...,b n }, we write a formal tensor 
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using the usual tensor notationPJ 



b ,...,b„ T u 

Aa ,...,a„, c / 1 



It's useful to think of the sets TP- as something akin to a horn-set, whose elements £ are mor- 
phisms from X®I L (i.e. \L\ copies of X) to X®I U L Penrose defines four operations over sets of 
abstract tensors. 

Relabelling: R-.T^ ->■ Ty for L ^ L' and (J ^ U' 

Addition: + : T L U x T L U -> T L U 

Outer product: (g> : 7]F x 7^' -> 7^F' 

Contraction: C^ : T L U ->■ 7^Jj } 

These satisfy certain compatibility axioms (e.g. associativity and identity laws), mirroring those 
of normal tensor contraction. 

Remark 2.3.1. The usual, categorical composition can be defined in terms of outer product and 
contraction. 

/3 Z ?'-' Z '; o oil°'-t ■.= c y x y ] ■ ■ ■ c y 7{ocl°rt ® ¥> z Y"' Zn , ) 

If we then include for every pair of labels a Dirac delta tensor 5 a (i.e. an identity map), with 
suitable axioms, the data from an abstract tensor system defines a symmetric traced category. In 



chapter 5.5 we make use of (essentially) the converse construction to prove the main theorem. 

With the inclusion of raising and lowering tensors g a &, g a ' , it becomes a compact closed cate- 
gory with X = X* . For this reason, abstract tensor systems are widely regarded as the prototype 
for compact closed categories, in their modern formulation. 

As in the concrete case, we represent outer (i.e. tensor) product as juxtaposition, contraction by 
repeating an index, and let relabeling be implicit. 

Example 2.3.2. The following is an abstract tensor contraction, followed by its explicit form, in 
terms of the functions above. 

< b p d c+7i b --=c c c 'K, b ®rt)+7i, b 

However, even with this convention, contraction expressions can get quite complex. Consider 
this expression, involving six abstract tensors: 

<i/ f f h' di MA (2.7) 

In order to work with this expression, one has to keep track of 11 indices, which makes com- 
putations time-consuming and error-prone. To address this issue, Penrose introduced a second, 



1 Note that the data associated with the tensor includes a total ordering on the sets L and U. This is implicit in the use of 
lists of index names in the tensor. 
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graphical notation. Tensors are drawn as boxes, and contractions over pairs of indices as wires. 
The "identity" tensor (i.e. the Dirac delta $1) is also drawn as a wire. The non-contracted, or "free" 
indices are left as dangling wires, and contractions 6\ are represented as circles. In the graphical 
notion, expression ||2.7) becomes the following diagram: 




Such a diagram can always be interpreted, unambiguously as an abstract tensor network, up to 
a relabelling of indices. It is also clear how the data above forms a symmetric traced category. 

2.3.2 Topological Approach: Anchored Graphs 

In 1991, Joyal and Street formalised the graphical Penrose notation as a generalised topological 
graph, with some added structure [35]. They went on to prove that variations of these graphs 
could be used to construct the free planar, symmetric, and braided categories on a monoidal signa- 
ture. The usual notion of a finite topological graph is a Hausdorff space that forms the geometric 
realisation of a one-dimensional, finite simplicial (or equivalently, CW) complex. A generalised fi- 
nite topological graph is a finite topological graph that is allowed to have some "open ends". That 
is, some edges in the graph look like the half-open interval or the open interval. We'll drop the 
adjective finite for the rest of this section, and assume that all of the graphs are finite. 

Definition 2.3.3 (Generalised Topological Graph). A generalised topological graph is a pair (G, Go), 
where G is a Hausdorff space and Go is a finite subset of points in G such that G — Go is isomorphic 
to a sum of open intervals I : = (0, 1) C ]R and copies of Si. The points in Go are called vertices. 
The compactification of an open interval I = e C G — Go is called an edge e. A copy of Si CG-Go 
is called a circle c. If a subgraph of G is an edge or a circle, we shall call it a wire in G Let W(G) be 
the set of all wires in G. 

Example 2.3.4. A generalised topological graph. The points in Gq are marked as dots. 
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This gives a bit more information than just the graph itself. The set Go is used to identify all of 
the "logical" vertices in the graph, not just the the "topological" vertices (i.e. those points lacking a 
neighbourhood of [0, 1]). In particular, a vertex can occur along an edge. Also note that G need not 
be compact. Let G be the compactification of G obtained by freely adding endpoints to any open 
ends. Since (— ) is left adjoint to the forgetful functor U : CHaus — > Haus, the embedding of e in G 
uniquely fixes an embedding from e into G. 

Note that all edges naturally embed in the compactification G D G obtained by adding end- 
points to open edges. 

e <- 



G 



String diagrams are not just graphs, but directed graphs. Thus, more data needs to be added to 
impose directedness to the wires in the graph. These wires can either be edges (= [0, 1]) or circles 
(= S 1 ). In both cases, we have orien table manifolds, so we impose directedness by giving each of 
these manifolds an orientation. 

Definition 2.3.5 (Polarised Graph). A polarised graph is a tuple (G, Go, (o w ), (p V/ i)), where o w assigns 
each wire w E W(G) an orientation. We can therefore define an input e(0) and an output e(l) for 
each edge. For each vertex v E Go, in(c) is the set of edges such that e(l) = v and out(c) is the set 
of edges such that e(0) = v. For all v E Go, p v ,o is a total order on in(u) and p Vi \ is a total order on 
out(c), called a polarisation. A polarised graph that contains no directed cycles is called progressive. 

Polarised graphs come with a notion of boundary. We can furthermore put an ordering on this 
boundary. 

Definition 2.3.6 (Boundary of a polarised graph). For a polarised graph (G, Go, (p w ), (p v ,-)), 3G := 
G — G is a discrete space called the boundary of G. Points in dG that are the input of some edge are 
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called inputs of G, and outputs of edges in dG are called outputs of G. A polarised graph with a pair 
of total orders /3q, $\ on its inputs and its outputs respectively is called an anchored graph. 



Example 2.3.7. We can make the topological graph from Example 2.3.4 into an anchored graph 
T = (G, Go, (o w ), (p V/ i), (j6,)). We depict T diagrammatically as follows. G is drawn as before, but 
with the elements of Go shown as squares. The orientations provided by CO are depicted as arrow 
heads on the wires. The polarisation p v , is shown by ordering from left to right the inputs and 
outputs to each square. The orderings /3, are shown by ordering the inputs and outputs of the 
graph from left to right. 





4. < 4- < 1 

This is very close to representing the diagrammatic language given at the beginning of this 
chapter. The only thing missing is labels on the boxes and wires. 

Definition 2.3.8 (Valuation). For an anchored graph T = (G, Go, (o w ), {p v ,i)r (fti)) an d a monoidal 
signature T = (O, M, dom, cod), a valuation v of T is a function vq that assigns an element of O to 
every edge or circle in T and a function v\ that assigns an element m of M to every point in Go in 
such a way that respects the domain on codomain of m. 

An isomorphism of anchored graphs with valuations (T, v) — » (T r , v 1 ) is an isomorphism of gen- 
eralised topological graphs that respects orientations, polarisation, and input and output ordering, 
and is compatible with the valuations v and v' . 

Since an anchored graph gives a total order to inputs and outputs, we can associate input and 
output words to a pair (T, v). Let T = (0,M, dom, cod) be a monoidal signature. Fg(X) is the 
category whose objects are words in w{0). For words V and w, arrows are isomorphism classes of 
progressive anchored graphs with valuations into T that have input word V and output word w. 
Composition g o f is defined by plugging the outputs of some representative of the isomorphism 
class of f into the inputs of some representative of g, then taking the isomorphism class of the 
resultant graph. 
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Theorem 2.3.9. j|35f Fg(T) is the free symmetric monoidal category over T. 



In 1351 , Joyal and Street alluded to a sequel paper "The Geometry of Tensor Calculus, II" (GTC- 
II) in which they would prove analogous results for traced and compact closed categories. For 



various reasons, this paper was never completed. In chapter 5.5 we show that categories of string 
graphs can be used to construct the free symmetric traced and compact closed categories over 
a monoidal signature. This, along with the fact that the geometric realisation functor lifts to an 
equivalence of categories between the string graph-based free categories and the topologically- 
defined free categories [24] suffices to prove two of the missing GTC-II theorems: namely that 
suitably defined categories of topological graphs can be used to define (i) the free symmetric traced 
category and (ii) the free compact closed category over a monoidal signature. 
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Chapter 3 

Algebraic Structures in Monoidal 
Categories 



In a monoidal category V, one can define various algebraic structures internal to V. A standard 
example is that of a monoid in V. 

Definition 3.0.10. For a monoidal category V, a monoid in V is a triple (X, u : X ® X — >■ X,n : I — >■ 
X) such that the following diagrams commute: 



X® U 

X«X<»X * x«x 



?<®x 

X«)X 



/' 



X 




* X 



X®w 

X - x®x 



X 




(3.1) 



The diagrams in this definition establish associativity left unit, and right unit. 

Example 3.0.11. A monoid in the category (Set, x, {*}) is the monoid in the usual sense. That is, 
a group without inverses. X is the carrier set, y : X x X — »■ X is the associative multiplication 
operation, and v : {*} — >■ X is the map the picks out the unit, i.e. f]{*) = e. If we write \i{x,\j) as 



x ■ y, then the commutative diagrams from Definition 3.0.10 are equivalent to these equations: 

• (x-y)-z = x- (y-z), 

• e ■ x = x, and x ■ e — x. 

Examples 3.0.12. A monoid in (Vect^, <8),K) is an associative, unital K-algebra. Since y : V <g> V — > 
V is a linear map, then due to the universal property of the tensor product, y uniquely determines 
a bilinear map (—•—): V x V — S> V. Again, associativity and unitality come from Diagrams (3.1 1. 
Similarly, a monoid in (Ab, <8,N), the category of Abelian groups and group homomorphisms, is 
a ring. 
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We can also express a monoid graphically, as a triple (X, tit : X ® X 
Associativity, left unit, and right unit become the following equations. 



^ -Y V-l V- 



X, f : I -»■ X). 



(3.2) 



By simply turning everything upside-down, we define a comonoid in a monoidal category. 

Definition 3.0.13. For a monoidal category V, a comonoid in V is a triple (X,S : X — > X <g> X,e : 
X — } I) such that the following diagrams commute: 



X 



x®x 

<5<g)X 



X®X *■ X®X®X 

Xig> s 



Graphically, these axioms are: 




- X 




xV/x A = l A 



A commutative monoid is a monoid satisfying the following equation: 





Similarly, a cocommutative comonoid is a comonoid satisfying the same equation, but upside- 
down. 





3.1 Bi-algebras and Hopf Algebras 

Suppose we wished to define a group in a monoidal category. We begin by looking at how groups 
are defined in Set. A group is a monoid, with an additional unary operation ( — ) _1 such that 
x ■ x _1 = e = x _1 • x. How can we express these equations as commutative diagrams? This poses 
a challenge, because unlike in the monoid axioms, the free variable x in the inverse axiom features 
twice. Luckily, Set is not just a monoidal category, but a cartesian monoidal category. A cartesian 
monoidal category is a category with finite products, where the monoidal product is defined using 
the categorical product and the monoidal unit is the terminal object. In the case of Set, these are 
the cartesian product on the one-element set. 

In particular, cartesian monoidal categories always have a diagonal map for every object in- 
duced by the universal property of the product. 
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Tl\ Til 

The diagonal map can be used to "copy" the free variable ieX. 

Definition 3.1.1. A group object in a cartesian monoidal category (C, x, T), is a tuple [X,]i,r],i) such 
that (X, p., n) is a monoid and the following diagram commutes: 



Xx X 



Xxi 



->XxX 





X 



T 



X 





Xx X 



ixX 



- XxX 



where A : X — » X x X is the diagonal map and ! : X — » T is the unique map from X to T, the 
terminal object. 

The picture becomes clearer by looking at the graphical versions of these identities. Note how 
the terminal map is used to "delete" the free variable, just as the A map is used to copy it. 




We have defined a group object in a cartesian monoidal category but we have not quite an- 
swered the original question. Is there a notion of a group in a (non-cartesian) monoidal category? 
The maps A and ! in a monoidal category do not come for free. Instead, we require that they be 
part of the structure of the the "group" object. To justify what this structure should be, we highlight 
some properties of (A, !). 

Proposition 3.1.2. For any object X in a cartesian monoidal category, the triple (X, A, I) forms a comonoid. 
Furthermore, the following diagrams commute for all X, Y, and f. 

f A, 



X 
Ax 
XxX 



/x/ 



Y 

Ay 

- y x y 




XxYxXxY 



X x ax j x Y 



XxXxY xY 
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Proof. We write (g, h) : A — »■ B x C for the unique induced map to the product of the pair of maps 
g : A — »■ B, h : A — »■ C. We can prove associativity using the definition of A and some well-known 

properties of (— , — }. 

(X x A) o A = (7Ti,Ao 7i 2 ) o A = (tti o A,Ao 7t 2 o A) = (lx,Ao l x ) 
= (lx, (lx, lx» = ((lx, lx), lx) = (A x X) o A 

Right unit (and similarly, left unit) follows from terminality of !. 

(Xx! x )oA = (m, l x o K 2 ) o A = (7T 1/ ! X xx)oA 
= (7r 1 oA / ! XxX oA) = (1 X/ ! X ) =l x 

The copying property is proved as follows: 

Aof= {l Xf lx)°f= if J) = (/o7rioA,/o7T 2 oA) 
= (f° K\>f o 7T 2 ) o A = (/ x /) o A 

For the final property we establish that (A x x Ay) o (X x a X/ Y x Y) = (l x ,y, 1x,y)- For sim- 
plicity let Tt\, Tii be projections of the two-part products (-x-)x(-x-), and let n' v n' 2 , n' 3 , n\ be 
projections of the four-part products (• X • X • X • ) . 

U 



L X,Y 



Xx Y 



L x,r 



Xx Y 



A X X Ay 



7Ti X 7Q = (^,71^) 



XxXxY xY 



XigicT-® y 



Xx Y 

(7r;,7T 2 ) =7Tl 

XxYxXxY 




(^3,0 = 7r 2 



► Xx Y 



x,v 



n 



We already saw graphical versions of the comonoid axioms after 3.0.13 The remaining two 
diagrams from Proposition|3.1.2 , 



are: 





A x B 




Ax 
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In particular, Proposition 3.1.2|implies that A copies \i. 



V 



A 

A also copies the unit n. By terminality, }i also "co-copies" !. 




(3.4) 



A-!! 



I 1 (I 



(3.5) 



In a monoidal category, a monoid paired with a comonoid that behaves like copy and delete 
operations is called a bialgebra. 

Definition 3.1.3. In a monoidal category V, a bialgebra is a tuple (X, ji, n, 5, e) where (X, u, r\) is a 
monoid, (X, 5, e) is a comonoid and the following diagrams commute. 



X®X 



X 



x®x 



Xg>crg>X 
X <g> X «i X <8) X » X <g> X ® X ® X 




X®X 




These conditions are the same as equations < |3.4[ and 1 3.5 1, but with (X, A, !) replaced by an 
arbitrary comonoid. 



A^t y 



In the general case, it is worth noting that the comonoid within a bialgebra acts like a copy and 
deletion operation relative to the associated monoid, rather than globally. This distinction applied to 
the category of Hilbert spaces has deep connections to the no-cloning theorem for quantum mechan- 
ics. See, for example [1]. In a cartesian category, any monoid can be made into a bialgebra using the 
comonoid (X, A, !). 

In the general case of monoidal categories, the inverse map is replaced with an arbitrary map, 
called an antipode. The analogue to a group object is a Hopf algebra. 

Definition 3.1.4. A Hopf algebra is a tuple (X,]i,rj,S,e,i) such that (X,ii r n,5,e) forms a bialgebra 
and the following diagram commutes: 
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X®X * X<S)X 





X I X 

5 \ / ¥ 

x®x - x®x 

i®X 
Example 3.1.5. For any group G, we can form the group algebra (K[G], }i, rj) as follows. Let K[G] 
be a vector space in Vect^ spanned by the elements g E G. \i is then defined on basis vectors by the 
group multiplication: u{g®h) = gh. 7/(1) = e, the group identity. This clearly forms a monoid in 
Vect^. We can turn this monoid into a bialgebra by adding maps that "copy" and "delete" the basis 
of group elements. We can make it into a Hopf algebra by adding a linear map that sends elements 
of the group basis to their inverse. 

S::g^> g®g 
e::g^\ 

i"-g^ g~ l 

The Hopf algebra (K[G],u,t],S,e,i) captures all of the structure of the group algebra K[G] with- 
out relying on "global" copy an deletion operations, which are not present in the (non-cartesian) 
monoidal category Vect^. 

Examples 3.1.6. Here are a few more examples of Hopf algebras. 

• Any group object in a cartesian monoidal category is automatically a Hopf algebra, with 
5 — A and e —I. 

• The universal enveloping algebra of a Lie algebra U(g) has a natural Hopf algebra structure, 
given by the unique extension of the following maps on g to U(g): 

S(x) = (x <g> 1) + (1 (8> x) e(x)=0 i(x) = -1 

• More generally, quantum groups of the form LTq(g) for q EC carry a Hopf algebra structure. 

3.2 Frobenius Algebras 

Associative algebras {A,}i,rj) in a compact closed category of vector spaces always come with two 
canonical right representations: one over A and one over the dual space A*. Frobenius algebras 
are associative algebras that are self-dual, i.e. these two representations are isomorphic. Frobenius 
algebras can be defined in any monoidal category. In order to talk about representations abstractly, 
it is convenient to use the (equivalent) language of modules. 
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Definition 3.2.1. For a monoid A = {A,ji,t]), a right „4-module (M,m) is a morphism m : A 
M — )■ M such that the following diagrams commute. 



A®A®M >- A®M 



fi® M 



A®M 



t 
M 



rj®M 

M ► A®M 



M 



Left ^l-modules are defined analogously, and for commutative monoids, the two concepts are 

equivalent. Graphically, the right module equations look like asymmetric versions of associativity 

and unit laws. 

A A M A A M MM 

MM M 

Definition 3.2.2. For a monoid A = [A,}i,r]), and right „4-modules (M,m) and (N,n), an A- 
module homomorphism is a morphism (p : M -^> N such that: 

A(g>cp 
A®M A®N 





M 



M 



N 



Module homomorphisms pass through the module map. 



M 



<P 



n) N 
N 




(3.6) 




Monoids in a monoidal category have a canonical right A -module, namely the monoid itself. 

A A A A A A 

\j vy 

Y Y 

A A 

This is called the regular right A-module. In a compact closed category, a monoid comes with a 
canonical right module over its dual object }i : A ® A* — > A*. 

p. := (A* ® d) o (A* ® u ® A*) o (e ® A ® A*) 

We shall call this the dual right A-module. Graphically: 
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Proposition 3.2.3. (A*, p.) is a right A-module. 

Proof. We can show the two module conditions are satisfied graphically. 



y/ 

Associativity: w' 









Unit: 



□ 

Frobenius is generally credited with being the first to study the class of algebras whose regular 
and dual modules are isomorphic. This is a particularly important aspect of many associative 
algebras, including algebras of the form K[G] for some finite group G. The formal definition is due 
to Brauer and Nesbitt lITTTl . An abstract version is provided here. 

Definition 3.2.4. A Frobenius algebra in a compact closed category is a monoid (A, ]i,r\) equipped 
with a module isomorphism % : (A,fi) — > (A*,fi). 

The existence of a Frobenius algebra on an object A implies that A is self-dual, and furthermore, 
the monoid structure }i, rj is compatible with that self -duality. Graphically, we represent % and its 
inverse x 1 as follows: 



x--- 



\ *- - \ 



Graphically, the axioms of a Frobenius algebra are the monoid axioms given by 1 3.2 1 and three 
additional identities: 



v-y 



(3.7) 



Proposition 3.2.5. Let G be a finite group. Then, the associative algebra K[G] is Frobenius. 

Proof. Let (K[G], j , f ) be the associative algebra of the finite group G. For g E G, let g be the 
unique map defined onh E G as: 



m 



1 i£h = g 
otherwise 



45 



For x, let x(g) — 8 1 - Since K[G] is finite-dimensional, this extends to an isomorphism K[G] = 
K[G]*. It only remains to show that ^ is a module homomorphism. It suffices to show that the two 



sides of equation 1 3.6 1 agree on all basis elements gi 6 G. 



81 



g2 



V 



81 



Si 

^7 



S3 



Si 




S3S1 



S3 



S3 



Si 



Si 



1 ^S3Si=Si 1 
otherwise 



Si 



S2 



X-Z 



Si 



Si 



SiSi 



S3 



S3 



S3 



otherwise 



The proof is completed by noting that g^gi = g 2 1 if and only if g\gi = g 3 1 . 



a 



There are actually many equivalent definitions for a Frobenius algebra. Two additional defini- 
tions replace the isomorphism condition given above with a non-degeneracy condition. 

Definition 3.2.6. A map of the form O : A ® A — >■ I is said to be non-degenerate if the following 
map is an isomorphism: 




Equivalently there exists a unique map <£>' : I — S> A (g> A such that: 



M 



M 



Theorem 3.2.7. The following are equivalent. 

1. {A,u,n,x)isa Frobenius algebra, 

2. {A, u, n) is a monoid with non-degenerate map <£> : A (3 A — > I that is associative with respect to the 
monoid: 




3. (A, u, n) is a monoid with a map e : A — s> I such that the following is non-degenerate: 
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4. and {A, pi, n) is a monoid, (A, 5, e) is a comonoid satisfying the Frobenius identity: 





Proof. (1 =$■ 2) Define <J> in terms of ^ as: 



The associativity of O follows from the definition of dual module and the module homomorphism 




identity in equation 1 3.7 1 





v 



(2 => 3) Let e be defined as: 



1-U 



It can then easily be shown that e o fi = O, which is non-degenerate. 
(3 => 4) Let <J>' : I —> A <g) A be the unique map such that: 





We can show this induced cap commutes horizontally with the multiplication: 









Define 5 as follows: 



(3.8) 

Then it follows from the monoid identities on (A, ]i, n) that (A, 5, e) is a comonoid and that the 
Frobenius identity holds. 

(4 =$> 1) Finally let the isomorphism x an d its inverse be defined as follows: 
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It follows from the Frobenius identity and unit laws that this is indeed an isomorphism. 





The only thing the remains to be checked is that this is an isomorphism of modules: 





fY 



fW=V 



r 



□ 



The equation from definition 4 first appeared in a paper by Carboni and Walters 1 14J in the 
context of categories of relations, though it was only later discovered to be a characterising equation 
for Frobenius algebras. This presentation is particularly interesting because, like in the case of 
bialgebras, it consists of (1) a monoid, (2) a comonoid, and (3) a rule for how they interact, called a 
distributive law. In definition 4, a Frobenius algebra consists of a monoid and a comonoid. Therefore 
a Frobenius algebra could be commutative or cocommutative. While these might seem like distinct 
conditions, it turns out they are the same. 

Theorem 3.2.8. For a Frobenius algebra (A,ji,n,S,e), (A,p,t]) is a commutative monoid if and only if 
(A, 5, e) is a cocommutative comonoid. 

Proof. First, we use Equation ||3.8} to derive another form for 5 in terms of }i. 



A y Yt 




Cocommutative then trivially follows from commutativity: 
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The opposite implication is the same proof, upside-down. 



□ 



It is worth noting that this is not the case for bialgebras or Hopf algebras. As a simple counter- 
example, consider any non-commutative monoid in a cartesian category. The cocommutative 
monoid (A, !) automatically makes this into a bialgebra. 

Example 3.2.9. Revisiting the group algebra example, we can define the rest of the Frobenius alge- 
bra structure as follows. 

gl,g2^G,gig 2 =g 

e = e 



The comultiplication can be though of an averaging operation. It takes a group element to a sum 
over all of its possible factorisations. 

Example 3.2.10. Any compact closed category automatically has a Frobenius algebra on A* (g) A 
given by: 



F-~- 




"■-- 



5:-- 




£ :~ 



In a t-monoidal category, we can introduce the notion of a t-Frobenius algebra, where the 
monoid structure is just the dagger of the comonoid structure. 

Definition 3.2.11. A Frobenius algebra (A, u, n, S, e) is called a t-Frobenius algebra if u — S f and 



As we have already seen in the proof of Theorem 3.2.7 the Frobenius identity implies that the 
Frobenius "cap" and "cup" maps form a compact structure. 





In fact, the Frobenius algebra induces a self-dual compact structure, i.e. A* — A. Using this 
compact structure, we can define a transposition operation ( — ) • T relative to a particular Frobenius 
algebra. 











o 












/• T 


= 


/ 












J 


■J 







(3.9) 



49 



The situation here is a bit delicate. Whereas in most categories there is a canonical choice of 
a compact structure when A and A* are distinct objects (e.g. a vector space and its dual space), 
defining a compact structure when A and A* are the same object often involves a choice. Different 
Frobenius algebras defined on a single object A will often define different compact structures. It 
is a well-known fact that for finite-dimensional vector spaces, there is no canonical isomorphism 
connecting a space to its dual space. Picking a self-dual compact structure corresponds to choosing 
a particular isomorphism A = A*. 

The distinction between compact structures, where duals are defined up to isomorphism, and 
(non-canonical) self -dual compact structures is worth bearing in mind, particularly with regards to 
the Frobenius algebras defined in Part In] However, the situation is simpler when it comes to the 



trace operation. In Equation 2.6 we showed that any compact structure can be used to construct 
a trace operation. It turns out that the compact structures generated by Frobenius algebras (and 
more generally, any compact structures) always generate the same trace. 



I T 


r/l 


n 

1 u 



In the case of commutative Frobenius algebras, we usually represent the trace without a twist 
on the cap: 



A 



T 

Often it is useful to consider maps that can pass freely through the Frobenius algebra structure. 
That is, we consider maps that are module endomorphisms of the regular left and right A modules. 
By analogy to phase gates in quantum circuits, we call such maps phases. 

Definition 3.2.12. For a Frobenius algebra A = (A, \i, n, 5, e), a map <p : A — > A is called a phase for 
A if: 



X 


I j 


x 




V 




Y~ 




" Y 



We shall see the connection between abstract phases and quantum phase gates in sections 7.2 
and 1731 
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Proposition 3.2.13. A phase for a Frobenius algebra A is also a comodule endomorphism for 5, considered 
as a left and right comodule. 









i 








A- 


<p 


- A 




<p 


f \ 


<P 






i 


i 






i 





Proof. The proof follows straightforwardly from the Frobenius identities. 



I 

m - 


^, 


1 
#* - 


\J 




A 


A \ \ 






T 
<P 

i 




/ V 

<p 



The left comodule identity follows similarly. □ 

There is a canonical phase associated with any Frobenius algebra called its loop map. 



Proposition 3.2.14. For a Frobenius algebra A = (A, ]i, r\, 5, e), the loop map is a phase for A. I.e.: 





Proof. The proof follows from the Frobenius identity. 






□ 

For commutative Frobenius algebras, phases can always be expressed as a left (or right) multi- 
plication by a point. 

Proposition 3.2.15. All phases for a given commutative Frobenius algebra are of the following form: 

IA 



Proof. A map of the above form is clearly a phase. Let cp : A — >■ A be an arbitrary phase map. Then: 



51 





a 



3.2.1 Normal Form for Frobenius Algebras 

We shall primarily be interested in commutative Frobenius algebras (CFAs). The primary purpose of 
this section is to show that CFAs have particularly nice normal forms. To do that, we will state and 
prove the so-called spider theorem. First, we introduce the notion of a spider. 

Definition 3.2.16. For a commutative Frobenius algebra A, a spider is a map SJJj defined as follows. 



S° - 



rn+1 .- 



I 



S 1 •- 



T s ? : = I s i : = 



i - 1 



cn 

^ in 






cn 
°m+l ■' 



-l¥ 



cn 

I ■■■ i 



We represent the maps SJJj as single dots, with m in-edges and n out-edges. 

i - I 



cn 

J m 



^~ r7 ^~r 



Since a CFA is a commutative, associative, cocommutative, and coassociative, we can freely 
interchange edges. 

Examples 3.2.17. Some spiders of various arities: 

-I H A-A v^y 






As a minor technical point, it is occasionally necessary to distinguish morphisms in a monoidal 
category from their actual representations as string diagrams. For this section, we shall use D, D', . . . 
to represent formal string diagrams and \D\, \D'\, . . . to represent the associated morphisms in a 
monoidal category. 

Definition 3.2.18. For a commutative Frobenius algebra A = {A,u,n,5,e), an „4-diagram D is a 
string diagram whose vertices are all labeled \i, rj, 5, or e. An A-tree is an *4-diagram that contains 
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no cycles. An „A-tree that is formally equal to the diagram of S m for some m, n is said to be in 

spider-normal form. 



Lemma 3.2.19. Suppose D is a connected A-tree with m in-edges and n out-edges. Then \D\ 
the value of an A-tree is uniquely determined by its input and output arities. 



SI. he. 



Proof. We proceed by induction on the number of vertices in D. Note that any one-vertex „A-tree 
is trivially equal to a spider Sj, Sj, S\, or S\. Thus, assume for an „A-tree containing N vertices, 
\D\ = SJJ,. We show that for an .A-tree D' containing N + 1 vertices, \D'\ = S n ,. Since a spider 
is commutative and cocommutative, we can assume without loss of generality that the additional 
vertex is composed on the rightmost leg above or below the spider. These cases follow trivially 
from associativity, unitality and the definition of spiders: 







The only cases remaining are post-composition by u and pre-composition by 5. First, consider 
pre-composition with 5: 




If m =0, this is already in spider-normal form. So, consider the case where m > 1. By defi- 
nition of SJJ,, we can pull out a multiplication. Applying the Frobenius identity and the induction 
hypothesis: 





m+l 



I.H. 




We complete the proof by applying the same method upside-down for post-composition with 





m+l 



I.H. 




«+l 



□ 
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Because of this lemma, we could have equivalently defined S" n in Definition 3.2.16 as S", = \D\ 
for any ^4-tree D with m inputs and n outputs. With the help of a normal form result for ^4-trees and 



Proposition 3.2.14 we are ready to state a general normal form result for commutative Frobenius 
algebras. 

Theorem 3.2.20. For a commutative Frobenius algebra A, any connected A-diagram is equivalent to one 
of this form: 




Proof. We first prove a small identity relating to traces: 





6 



(3.10) 



Using the axioms of a compact closed category, we can deform any ^4-diagram into an *4-tree 
with traces. 




Applying Lemma 3.2.19 we have: 
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We can then turn each of the traces into a loop map, using equation [3.10), then push the loops 



up using Proposition 3.2.14 




Repeating the process for each of the traces, we obtain a diagram in normal form. 

3.2.2 Special and Anti-special Commutative Frobenius Algebras 

For any monoid in a traced category, we can define a map k : A (g> A — S> I: 



□ 



k :- 




(3.11) 



In FVectjo any map from A <8> A to the base field K is called a bilinear form. The map given by 
(3.11 is a particularly important bilinear form called the Killing formn This form plays a partic- 
ularly important role in the representation theory of an algebra. For an algebraically closed field 
K, the Killing form of a finite-dimensional associative K-algebra is non-degenerate if and only if 
that algebra is semisimple. Since the Killing form automatically associates with the multiplication, 
any algebra with a non-degenerate Killing form (i.e. any finite-dimensional semisimple algebra) is 
automatically Frobenius. However, the converse is not true. There are many interesting Frobenius 
algebras that have degenerate Killing forms. First, note that we can relate the rank of the Killing 
form of a commutative Frobenius algebra with that of the loop map defined in the previous section. 



rank 




rank 




rank 




We shall primarily focus on the minimal and maximal cases of this rank: the cases where the 
loop map is full-rank or rank one. It is never rank zero, except in the case of the zero-dimensional 
space. We wish to abstract the notions of full-rank and rank one to an arbitrary category. Clearly 
full-rank maps are just isomorphisms. A linear map is rank-one if any only if it can be factored 
through the base field: 



1 The Killing form is much more commonly defined for Lie algebras than for associative algebras. The Killing form of an 
associative algebra is sometimes called simply its trace form. 



55 



A 




K 



B 



We therefore define disconnected morphisms as an abstraction of rank-one linear maps. 

Definition 3.2.21. A morphism f : A — > B in a monoidal category is called disconnected if it factors 
through the tensor unit. 



T$ 



For a Frobenius algebra, having an invertible loop is the abstract analogue to being semisimple. 
In particular, we shall consider commutative Frobenius algebras whose loop map is equal to the 
identity. 

Definition 3.2.22. A special commutative Frobenius algebra (SCFA) is a commutative Frobenius alge- 
bra A = (A, u, rj, 8, e) such that }io S = 1 A . Graphically: 



(3.12) 



This is not a great loss of generality, given the following theorem. 

Theorem 3.2.23. For any commutative Frobenius algebra A = {A, ]i, rj, 5, e) such that p Sis invertible, 
there exists an invertible phase L such that (A, u, rj, 5 o L _1 , e o L) is an SCFA. 

Proof. Since A is semisimple, the loop map is invertible. Therefore, let L = y. o $. Module en- 
dormorphisms are closed under inversion, so since L is a phase, L is a phase. For, we show 
(A, S o L -1 , e o L) is a comonoid: 
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The Frobenius identity: 
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Specialness follows by definition of L. 



I 


I 


L- 1 

4 


L- 1 

= 1 - 
L 


u 


I 


I i 



a 

It is also worth noting that for special Frobenius algebras, defining just the monoid part (or just 
the comonoidal part) is enough to fix the entire structure. This is because the partial trace of \i is 
equal to e. 



v:i -6-1 



By condition (3.) of Theorem 3.2.7 (A, u, n, e) is enough to define the entire Frobenius algebra. 



Example 3.2.24. Fix a basis e, for a finite-dimensional vector space V. We can define the e, to be the 
vectors "copied" by 5 and "deleted" by e. 



S :: e,- h- >• e, ® e; 



e :: e,- h- > 1 



This clearly forms a cocommutative comonoid. We can complete the Frobenius algebra by letting 
the e, be a basis of idempotents of pi. 



]i :: e, ® e,- h-> e,- 



?/::E e ' 



This forms a commutative Frobenius algebra, and by definition, jiS = ly. We can do this for 
any basis e,-, and its a well-known fact that any semisimple commutative algebra {A, ]l,r\) over an 
algebraically closed field has a basis of idempotents, summing to rj, so in particular, all SCFAs are 
of this form. 

Now, we consider the other extreme: the case where the loop map is disconnected: 



(3.13) 



Due to a theorem by Herrmann [30|, we can actually obtain an explicit form for equation 13.13) 



Theorem 3.2.25. Let Abe a commutative Frobenius algebra with a disconnected loop map. Then the fol- 
lowing equation holds: 
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o 






Proof. Assume the following equation holds, for any maps x : A — »■ I,y : I — » A: 



Then 




U $ 

n $ 






a 



A commutative Frobenius algebra with a disconnected loop is called anti-special. 

Definition 3.2.26. An anti-special commutative Frobenius algebra (ACFA) is a commutative Frobenius 
algebra such that: 



o 



h 
? 



In addition to having a unit and a counit, anti-special Frobenius algebras have canonical dis- 
connecting points which we shall refer to as the anti-unit and anti-counit. 

Definition 3.2.27. For an ACFA, the anti-unit if and the anti-counit e are defined as follows: 



n 



? - f:\ 



e : = 



k 




Special and anti-special Frobenius algebras have well-behaved normal forms. 
Theorem 3.2.28. For an SCFA S, any connected S-diagram is equivalent to a spider. 



Proof. Any connected 5-diagram is equivalent to one in the form given in Theorem 3.2.20 We can 
then use equation 1 3.12 1 to remove all of the loops. □ 



Lemma 3.2.29. For an ACFA A, the comultiplication map copies the anti-unit and the multiplication map 
copies the anti-counit. 
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o 



o 




Proof. Follows straightforwardly by an application of Theorem 3.2.20 and the anti-specialness con- 
dition. 



o 



The upside-down equation is proved similarly. 




^ 



-? 




□ 



Theorem 3.2.30. Suppose Qi is an invertible scalar. For an ACFA A, any connected A-diagram is equiv- 
alent to one of the following, for scalar map k : I — > I. 



a.) 



(ii.) 




(Hi.) 



H 






Proof. If a connected *4-diagram contains no loops, it is equivalent to a spider by Theorem 3.2.20 
An ^4-diagram containing more than one loop is equivalent to a scalar multiple of two disconnected 
diagrams containing one loop each. 






Q O 



(3.14) 



? ? 



Similarly a diagram containing just a single loop can be made into two disconnected diagrams 
containing single loops. 










(3.15) 



If the diagram has no inputs or outputs, it is in the form of (L), so assume it has at least one 
input or output. In that case, any diagram equivalent to the RHS of 1 3.14} or 1 3.15) can be put into 
the form of (iii.) using Lemma 3.2.29 □ 
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Parti 



Graphical Languages and Rewriting 
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Chapter 4 

Rewrite Systems 



Rewrite systems provide a model of computation that is particularly well suited to formalising 
dynamical systems, computing algebraic identities, and constructing proofs by automated or semi- 
automated means. The most well-studied type of rewrite systems are term rewrite systems. A term 
rewrite system consists of a set of generators (i.e. symbols with arities), variables, and rewrite rules 
between terms formed from generators and variables. 

Term rewrite systems have applications in the study of programming languages, computer al- 
gebra systems, automated theorem proving, and many other areas of theoretical computer science. 
Rewriting terms is essentially the same as rewriting trees, and it was shown that many of these 
"tree" rewriting techniques could actually be applied to arbitrary graphs, or even objects of more 
general categories. 

In this chapter, we shall review some of the basic principals of rewrite systems and the double- 
pushout approach to graph rewriting. We will then illustrate how the DPO approach is always 
well-defined in a particular class of graph-like categories called adhesive categories. However, to 
define a category suitable for rewriting string graphs, we shall need a weaker notion than adhe- 
sivity. For that reason, we introduce partial adhesive categories, and show how these categories 
inherit "enough adhesivity" from their ambient adhesive categories to do DPO rewriting. 

4.1 Term Rewriting 

Definition 4.1.1. A term signature £ = (G, a : G — > N) consists of a set G of generators and a function 
a assigning each generator an arity 

We define the set of terms for a signature and a set of variables recursively. 

Definition 4.1.2. For a term signature £ = (G, a) and a set X of variables, we can form the set 
X(E, X) of terms as follows: 

• For all i £ X, i is a term. 
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• For g £ G such that a(g) = n and terms t; 6 T(S, X), g(ti, ■■■An) is a term. 

Variables are place-holders for other terms, i.e. other elements of T(E, X). The mechanism by 
which variables are assigned values is called substitution. 

Definition 4.1.3. For a set of terms T(E, X), a function <j : X — > T(£, X) is called a substitution. It 
can be lifted to a function & : T(L, X) — »■ T(L, X) as follows. For a term £ 6 T(E, X), replace every 
occurrence of any variable x E X in the term with o~{x). The resulting term is &{t). 

A rewrite rule is a pair (/, r) E T(S, X) x T(E, X), usually written / — > r. A rewrite rule can be 
used to rewrite a term t E T(E, X) to a new term t' £ T(E, X). This occurs in two stages. 

1. Matching: A substitution a is chosen such that cr(l) occurs as a sub-term of t. 

2. Replacement: The occurrence of &(l) in t is replaced by &(r). 

We can elaborate on this process a bit. Terms are essentially just trees: 



((x + lHy + z)) 



For a given tree, a subtree can be uniquely identified by its lexicographic position. The lexico- 
graphic position of a sub-tree t' of t is simply a list of natural numbers p — [cq, C\,..., c n _j]. The 
root of f' can then be located by taking the Co-th child of the root vertex of t, the Ci-th child of that 
vertex, and so on until C n -\. In other words, if we represent a term t as a list of lists, 

f = t[c ][c 1 ]...[c n .i] 

Thus, in the case of term rewriting, finding a matching / in / — > r is simply a case of identifying 
a substitution and lexicographic position such that 

t[co][ci] ■ ■ ■ [c„_i] =cr{l) 

Performing the rewrite is then just a case of replacing the subtree at that position: 

t[c ][ci]...[c„_i] :=&{r) 

A set 1Z of rewrite rules is called a rewrite system. We write t — c> -ji t' there exists a rule / — e> r E 1Z 
that rewrites t into t' using the above procedure. This forms a relation — i>ji C T(L, X) x T(E, X) 
called a reduction relation of 72.. It is not always the case that the LHS of some rule in 1Z will match 
a given term t. If there is no such matching, t is called irreducible. Otherwise, it is called reducible. 
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Example 4.1.4. Consider the algebraic theory of unital rings. This theory has two binary operations 
( — | — ) and (— ■ — ) as well as two 0-ary operations (i.e. constants) 1 and 0. The usual ring axioms 
can be turned into a rewrite system by directing each of the equations. 



1. ((x + y)+z)- 


-^>(x+(y + z)) 


2. ((*-y)-z)-t 


.(x-(y-z)) 


3. (x + 0) — > x 




4. (0 + x)^> x 




5. (x ■ 1) — > x 




6. (1 • x) — > x 




7. (x-(y + z))- 


■>((*-y) + (*-z)) 


8. ((x + y)-z)- 


^>((x-z) + (yz)) 


We can define a 


term 



f:=(0 + (((l+y)+y)-(x + z))) 
Let / — > r be rule 8 from above. To find a matching of I on t, first define a substitution: 

u :: {x I— 7- (1+y), y i— >• y, zi— s> (x + z)} 

Apply the substitution to / and r: 

^(/) = (((l+y)+y)-(x + z)) 

fr(0 = (((i+y)-(* + z)) + (y •(* + *))) 

Note that (j(/) now occurs as a subterm of f. 

t = (0 + 8-(/)) 

We form a new term t' by replacing the occurrence of &{l) with cr(r). 

f / = (0 + ^(r)) = (0+(((l+y)-(x+z)) + (y(x + z)))) 

— >7j is used to represent the reflexive, transitive closure of — o-n, and < — >ji the reflexive, 
transitive, symmetric closure. In the case where 1Z represents the "directed version" of the axioms 
of an algebraic structure, it is often the goal to evaluate the truth of the following proposition: 



'R 



t' 
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Or, "Is t equivalent to t' by the axioms of HP." As this is a statement of the word problem, it is 
not decidable for arbitrary rewrite systems 1Z. However, there are many "good" rewrite systems, 
where this proposition is decidable. Two characteristics of a rewrite system that make it "good" are 
termination and confluence. 

Definition 4.1.5. A rewrite system 1Z is said to be terminating if there exists no infinite chain of 
rewrites: 

t\ — > t 2 — > . . . — > t„ — > . . . 

In practice, one often proves termination by identifying a reduction order on terms. 

Definition 4.1.6. A partially ordered set (P,<) is called well-founded (or Noetherian) if it has a 
smallest element and contains no infinite sequence of strictly decreasing elements: 

pi > p 2 > • • • > Pn > ■ ■ ■ 

Well-foundedness is usually defined by every non-empty subset P' C P having at least one 
minimal element (i.e. an element that is not strictly greater than any other). Up to the Axiom of 
Choice, these two definitions are equivalent. Intuitively, a well-founded poset is a poset over which 
one can perform (generalised) induction. A standard example is (N, <), which corresponds to the 
usual notion of induction over the natural numbers. 

Definition 4.1.7. Let (P,<) be a well-founded poset. A function w : T(L,X) — > P is called a 
reduction ordering for a rewrite system 1Z if: 

h—>ftt 2 => <*>(h) > u>(t 2 ) 

Clearly any rewrite system that admits a reduction ordering is terminating. Termination guar- 
antees that even a naive rewriting strategy (choosing rules at random, applying until there are no 
more matchings) will terminate. If we can rewrite a term t in any number of steps to an irreducible 
term t' , t' is called a normal form of t. However, with a rewrite system that is only terminating, there 
is no guarantee that two distinct sequences of rewrites will result in the same normal form. To get 
unique normal forms, we need an additional property called confluence. 

Definition 4.1.8. A rewrite system 1Z is said to be confluent if sequences of rewrites starting with 
the same term can be rejoined. That is, for terms t, t\, ii such that t — > t\ and t — > t 2 , there exists a 
term t' such that t\ — > t' and t 2 — > t'. 



* 



V 
h 
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-> 


h 
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H> 


V 
t' 
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Theorem 4.1.9. If a rewrite system 1Z is terminating and confluent, every term t has a unique normal form 

Proof. The existence of at least one normal form is guaranteed by termination. Suppose i\ and ^ 
are normal forms for t. Then t — > t \ and t — > ti, so by confluence there exists t' such that t\ — > t' 
and ii — > t'. However, since i\ and ii are irreducible, the only possibility is that t\ = t' = ti. □ 

Terminating, confluent rewrite systems provide a decidable solution to the word problem. 

Theorem 4.1.10. For terms t\ and ti, t\ < — > ti iff t\ \. = ti I . 

Proof. (<=) follows from the definition of ( — ) 4_. For (=>), assume t\< — > ti- Then, there exists a 
finite sequence of forward and backward rewrite steps between t\ and ti. 



h <— Pi — > a\ 



-> q n -\ ■ 



-> <\n- 



h 



(4.1) 



Note that the rewrite sequence consists of "peaks" p; and "troughs" c\{. We proceed by induction 
on the number of peaks. An arbitrary rewrite sequence with n peaks, as in equation | |4.1) can be 
reduced to a rewrite system with n — \ peaks by applying confluence to q n -\ < — p„ — > q n . 



h <— Pi — > <7i 



a n-\ 



q„ 



h 



If there are zero peaks, then there exists at most one trough q, and by confluence t \ \, — q \, = ^4- 

□ 



4.2 Graph Rewriting 



Definition 4.2.1. Let Graph be the category of (directed, multi-) graphs. It is defined as the functor 
category [G, Set], for G defined as: 

s 

e ; v 



E identifies the edges of the graph, and V the vertices, s and t are functions taking an edge to its 
source and target respectively. If t(e) = v then e is called an in-edge of v and if s(e) — v then e is 
called an out-edge of v. 

For a graph G : G — $> Set, we shall write Vq for G(V), Eq for G(E), etc. and drop the sub- 
scripts when it is clear from context. Since a graph homomorphism / : G -> H is just a natural 
transformation, it is a pair of functions fy, ft such that: 

t 



Vr 



Vr 



h 



fv 



h 



fv 



~-H 



V H 



'-H 



V H 
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It is natural to ask if term rewriting (i.e. tree rewriting) techniques can be extended to arbitrary 
graphs. It turns out that most notions and techniques from term rewriting translate directly to 
graphs, but the concepts of matching and replacement become more complicated. This is because, 
unlike in the case of trees, there is no canonical "root vertex" of a graph, and hence no absolute 
notion of "position". Proceeding by analogy to the term rewriting case, suppose we wish to apply 
a graph rewrite rule L — > R to a graph G. This poses two problems: 

1 . How does one keep track of where a matching of the LHS of a graph rewrite rule has been 
made? 

2. How does one decide where to attach to the RHS of a graph rewrite rule? 

The first problem is solved by letting a matching be represented by an injective graph homo- 
morphism m : L — » G, subject to certain conditions. The second problem is solved by requiring 
that all graph rewrite rules have a common subgraph embedded into the LHS and RHS called the 
invariant subgraph of the rewrite rule. 

This invariant subgraph serves as "glue" to attach R to G after (the non-invariant part of) L has 
been removed. We can sketch out diagrammatically how this works. Suppose we have matching 
m : L — »■ G. Since m is an injection, we can think of L as a subgraph of G. Before we insert R, we 
must remove L. To do this, we might consider using the graph theoretic subtraction. 

Definition 4.2.2. For a subgraph G' C G, the graph theoretic subtraction G — G' is a new graph 
formed by removing G', as well as any edges in to or out of G', from G. 

So G consists of a component L, a component G — L, and some edges between those compo- 
nents. 











L 






G-L 





If we simply delete L from G, there is no way to keep track of how L was connected to G in the 
first place. To get around this, we treat the invariant subgraph of the rewrite rule as an interface for 
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L, and further decompose G: 



G-L 




G-L 



(4.2) 



The part of L that is not contained in I (i.e. L — I and the edges between L — I and I) is called 
the interior of L. Note that there are edges between G and I and there are edges between I and 
L — I, but there are no edges directly connecting G to L — J. We require this to be the case for any 
valid matching m of the rewrite rule. This is called the no-dangling-edges condition. Now, when we 
replace L with R, we know where the edges connected to I should go, since R also contains a copy of 
I. 





G-L 













I 








R-I 












G-L 






This procedure can be formalised elegantly using pushouts. Begin by noting that the graph G 
with the interior of I removed is the (unique) graph G' such that the following square is a pushout: 

I ' L 



r + 

- G 



G' <- 

In other words, G is the result of gluing L and G' together along J. G' is called the pushout 
complement of I ^> L — > G. Once we have G' and m' : I —} G', we can glue on R by performing a 
second pushout: 



I 



-* R 



G' 



r 



H 
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So, to complete the rewrite G — > H, we first compute a pushout complement, then compute a 
pushout. This style of graph rewriting is called the double pushout (DPO) approach. We typically 
express the whole rewrite as a single DPO diagram: 



M 


— > I < — 


" 






m' 




"1 


1 


r 


~4 


G' 


►- 



H 



Note that pushouts (and pushout complements) are defined up to isomorphism, so for G = G', 
H = H' , G — > H if and only if G' — > H'. As before, a set of graph rewrite rules 1Z is called a graph 
rewrite system, and we write G — > ■% H if there exists a rule in TZ that rewrites G to H, as above. The 



notions of confluence and termination are identical to those in Section 4.1 as well as the proofs of 



Theorems 4.1.9 and 4.1.10 for graphs, replacing term equality with graph isomorphism. 

Example 4.2.3. Let the following be a rewrite rule L <-^ I ■— > R: 

-^P O *Q C L > 





o o o 

Then, we can find a matching of L on a bigger graph G: 



o^ 




We perform the rewrite by first removing the interior of L, then gluing R to remainder of the 
graph along the common subgraph J. 




The full double pushout diagram looks like this: 
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m 




O KD 



O O O 



in 



o> 



-T 




4.3 Adhesive Categories 

In the previous section, we discussed double-pushout rewriting in the category Graph. However, 
we may also wish to do rewriting on the objects of many categories that look a bit like the cate- 
gory of graphs, such as typed graphs, graphs with extra data, Petri nets, or objects of an arbitrary 
topos. There have been two main approaches to carrying out this abstraction. The first was to 
define DPO rewriting in the context of high-level replacement (HLR) systems, introduced by Ehrig 
et al [28 J. The second approach, which we shall build on in this dissertation, relies on adhesive 
categories, introduced by Lack and Sobocihski H44I . These two notions are actually compatible, as 
was shown with the definition of adhesive HLR categories in [59 1, which categories equipped with a 
class of monomorphisms that behave "adhesively". Our construction of partial adhesive categories is 
somewhat in the spirit of adhesive HLR categories, but adhesive behaviour is localised to a certain 
class of well-behaved spans within the category rather than a certain class morphisms. 

Adhesive categories provide a general context in which rewriting on graph-like structures is 
well-defined. Their definition relies on the notion of a special kind of pushout called a van Kampen 
square. 

Definition 4.3.1. A van Kampen square (VK-square) is a pushout 



C 



D 



Such that for any commutative cube 
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where the back and left faces (ABEF and ACEG) are pullbacks, the following are equivalent: 

• the front and right faces (CDGH and BDFH) are pullbacks 

• the top face (EFGH) is a pushout 

/ S 

A pushout of a span A < — B — > C where either f or g is a monomorphism is called a pushout 

along a monomorphism. An adhesive category is a category where pushouts along monomorphisms 

are van Kampen squares. 

Definition 4.3.2. A category A is said to be adhesive if 

1. A has pushouts along monomorphisms, 

2. A has pullbacks, 

3. and pushouts along monomorphisms in A are van Kampen squares. 

At first sight, the definition of van Kampen squares can seem rather opaque, so it is useful to 
consider a concrete example. Suppose we have a set X = A U B which is composed to two (possibly 
overlapping) subsets A and B. There are two equivalent ways to define a map into X: 

1. For some set X', simply define a function/ : X' — S> X. 

2. For sets A', B', define functions /a '■ A' — > A and /g : B' — S- B such that /a and /g agree on 
A n B. I.e. for the restrictions f A \(AC\B) : I -> A n B, / B |(A n B) : I' -> An B, I = I' and 

/ A |(AnB)=/ B |(AnB). 

First, we'll see how we can obtain (2.) from (1.). Starting with a map f : X' — > X, one can simply 
let /^ and /g be the restrictions of f to the subsets A and B respectively. The restriction of a function 
to a subset of its codomain is just a pullback. We can therefore define /a and fg by this diagram: 



A' 



- X' 



B' 



L 



(4.3) 



A 



X 
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To obtain (1.) from (2.), we start with maps Ja,Jb, such that f^\ (A fl B) =/g|(An B). Again 
using pullbacks to express restriction, this means the following diagram commutes (treating I from 
above as A' H B'): 

A' - A' n B' B' 



/^ 



Let X' = A' U B' and define / as: 



h 



(4.4) 



ahb 



/(*) 



/x(s) ifxeA 
/ B (x) ifxeB 



This function is defined unambiguously, because if x is in both A and B then /a( x ) — /b( x )- 
Since X' = A' U B' is a pushout, we can equivalently define / as the induced map in the following 
diagram: 

A'HB' B' 



A' 



r 





x' 



A 



(4.5) 



X 



It is easy to show that these two procedures are the inverses of each other. If we combine 
diagrams | |4.3) , | |4.4[ , and ( |4.5) into a single diagram, we get the commutative cube from Definition 



A' 



- X' 



A'm 



Am 



A 



X 



Now, as in Definition 4.3.1 suppose we have a commutative cube where the bottom face is a 
pushout (i.e. X = A U B) and the left and back faces are pullbacks (i.e. f A \(ADB) = / B |(A n 
B)). We can now read the van Kampen square condition as follows: the front and right faces are 
pullbacks (i.e. / restricts to /a and fg along A and B respectively) if and only if the top face is a 
pushout (i.e. X' = A' U B'). 
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Examples 4.3.3. Some examples of adhesive categories: 

• Set, Graph, and Set* are adhesive categories. 

• For adhesive categories C and T> and an object X 6 C then C xT>,X/C, and C/X are adhesive. 

• For a small category X and an adhesive category C, the functor category [X, C] is adhesive. 

• Any elementary topos is adhesive. 

Note that unlike toposes, adhesive categories are stable under coslices. In particular, Set* is an 
adhesive category, but not a topos. 

Adhesive categories are useful for double-pushout rewriting because pushout complements for 
pushouts along monomorphisms are unique, when they exist. 

Definition 4.3.4. A pushout complement for a pair of arrows A — > B — > D, is an object C and a 

/ n 

pair of arrows A — > C — > D such that the following is a pushout: 




The pushout complement above should be thought of as "subtracting B from D, modulo A". If 
we think of A as the interface of B, then another way to put it is "removing the interior of B from 
D". 

Notation 4.3.5. We occasionally write B + m j C for pushout of the span B *^— A — > C and D — m ,g 
B for the pushout complement of A — > B — > D. 

We first need a few basic lemmas before showing that pushout complements in an adhesive 
category are unique. 

Lemma 4.3.6. [44] In an adhesive category: 

• monomorphisms are stable under pushout, and 

• pushouts along monomorphisms are also pullbacks. 

Proof. Let m be a monomorphism, and let the following diagram be a pushout: 

A B 

/ 
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We need to show that n is mono and the above square is also a pullback. Construct a commutative 
cube, containing two copies of the given pushout square: one on the bottom face and one on the 
right face. Place a copy of / in the upper-left corner, and fill in the rest with identities. 

1 



C 



C 



/ 



/ 



c 



D 



All of the faces of this cube commute trivially. Commutative squares involving identities often (triv- 
ially) form pullbacks or pushouts. In particular, the top face is pushout, the left face is a pullback, 
and the back face is a pullback iff m is a monomorphism. Since m is defined to be a monomorphism 
and the pushout we started with is a VK-square, adhesivity shows that the front and right faces 
must be pullbacks. Since the front face is a pullback, n is a monomorphism and thus monomor- 
phisms are stable under pushout. Furthermore, since the right face is the pushout we started with, 
we have also shown that pushouts along monomorphisms are pullbacks. □ 

We now provide a few lemmas about pullbacks and pushouts that hold in any category. 

Lemma 4.3.7. For the following commutative diagram, where the right square is a pullback: 

, f 



B 



C 



h 



] 



D 



k I 

the outer square is a pullback iff the left square is a pullback. 

Proof. Left implies outer is trivial. For the other direction, assume the outer square is a pullback. 
For an object A' let f':A'->B and h' : A' -> D be arrows such that if = kh'. 

f 



h' 



A 


►■ I 

f 


3 


C 


h 




i 




D 


1 

k 




F 

I 
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Then gf and h! form a cone under the outer pullback, so there exists a unique u : A' — »■ A 
such that gf = gfu and h! = hu. From the universal property of the right pullback, it follows that 
/' = fu, so the left square is a pullback. □ 

Lemma 4.3.8. For a commutative cube, where the front, right, and back faces are fullbacks: 

E *■ F 




the left face is also a pullback. 



Proof. Since the back and right faces are both pullback squares, we can apply Lemma 4.3.7 to show 
that the back-right two faces form a larger pullback square. Then by commutativity of the cube, 



the square formed by the left-front two faces is also a pullback square. Applying Lemma 4.3.7 from 
right to left concludes that the left face is a pullback. □ 

Lemma 4.3.9. Isomorphisms are stable under pushout. For the following pushout, if (p is an isomorphism, 
then so too is q. 




Proof. Let fcp 1 and \q form a cocone to C over the pushout. It follows straightforwardly that the 
induced map q' : D — > C is the inverse of q. □ 

We are now ready to prove the uniqueness theorem for pushout complements. 

Theorem 4.3.10. /ST . If a pair of arrows (m,g), where m is mono, has a pushout complement, it is 
unique up to isomorphism. That is, for any two pushout complements, (f,n) and (/' ,n'), there exists an 
isomorphism cp making the following diagram commute: 

f 



A 



r 




c 



-* c 



D 



(4.6) 
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Proof. Suppose both of the following are pushout squares: 



A 



+ r 

C D 

n 



r 



r 

C - D 



Following a similar strategy to Lemma 4.3.6 we use these squares to build a commutative cube: 

C" C 



\h 




i 




V 


A - 


-► A 


1 








m 


i 






■ 


\y 


\ ~ 


m 


-►- B 

^1 



c 



D 



where the first pushout square forms the bottom face, and the second pushout square forms the 
right face. The front face is the pullback of n and n', and the back face is the pullback of m with 
itself. This pullback consists of identities because m is mono. Now, f and f form a cone under the 
pullback of n and n' , so let h be the induced map. 

The right face is a pushout along a monomorphism, so by Lemma 4.3.6} it is also a pullback. We 



can therefore conclude from Lemma 4.3.8 that the left face is also a pullback. From the VK-square 



property, we can then conclude that the top face is a pushout. By Lemma 4.3. 9} k is an isomorphism 



We can also form a similar cube, but with the positions of the two pushouts interchanged: 

I 



\ h 


w l_ 


A - 


-1-+ A 


"J 




m 


w 


B 

m s 


->l 


► D 



and conclude similarly that / is an isomorphism. By construction, all faces commute, and thus by 
letting (p := A:/ -1 , we can read off the statement of the theorem from the commutative cube (with 
the relevant arrows shown in bold). □ 

As in the case of graphs, a rewrite rule in an adhesive category A is a span of monomorphisms. 

L ^ >R:=L< ^ I J^ R 
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DPO rewriting consists of three steps. 

1. Identify a matching m : L — » G. 

2. Compute the pushout complement G' of L in G. 

3. Push out G' and R to obtain a rewritten graph H. 

First, we define matchings. Adhesive categories only guarantee the uniqueness of pushout com- 
plements, not the existence. In most categories, these will not exist for arbitrary pairs of morphisms. 
Thus, we build the existence condition into the definition of matching. 

Definition 4.3.11. For a rewrite rule 

L^R:=L^-I-^R 

a matching m of L — > R on G is a monomorphism m : L — > G such that the following pushout 
complement exists: 

h 
I - L 



+ r + 

G' - G 

This existential condition is not particularly useful in determining which morphisms are match- 
ings. Luckily, in specific adhesive categories, we can often do better. For graph-like categories, we 
can usually ensure the existence of a pushout complement using some version of the no-dangling- 
edges condition. 

Definition 4.3.12. A monomorphism m : L — > G of a rewrite rule L < — I — > R in the category 
Graph is said to satisfy the no-dangling-edges condition if for any vertex c £ V^ — V\, all edges 
incident to m(v) must be in the image of m. 

Theorem 4.3.13. In Graph, a monomorphism m is a matching if and only if it satisfies the no-dangling- 
edges condition. 

Proof. Since I — > L — > G are both monos, we will assume without loss of generality that I C 
L C G. For (<^), we form the pushout complement G' as a graph with vertices Vq — (Vi — Vj) and 
edges Eq — (Ei — £/). By the no-dangling-edges condition, if s(e) £ Vj, — Vj or t(e) £ V^ — Vj then 
e 6 Ei. If e were to be in Ej then s(e) £ Vj and t(e) E Vj so e E Vi — Vj. Therefore, the maps s and t 
have well-defined restrictions to G'. Furthermore, G'UL = G and G' ("I L = I, so G' is the pushout 
complement. 

For (=>), suppose m is a matching. Then there exists a pushout square: 
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+ r + 

G' * G 

*i 

Since monos in an adhesive category are stable over pushout, b\ and m' are also monos. As 

such, we identify I with its image under m' regard GasG'U L. Let v be a vertex in V^ — Vj, and e 

an edge incident to v in G. Suppose e is not in L (i.e. the image of m : L — »■ G), then e must be in G'. 

Thus c must also be in G', so c 6 Vqi DVi = Vj, which is a contradiction. We therefore conclude 

that e &E L . a 

We can now define the notion of a rewrite in an adhesive category. 

Definition 4.3.14. For a rewrite rule L — > R and a matching m : L — >■ G, the rewrite of G into H is a 
double pushout diagram: 

L 7 ► K 



in + r ^ 

G G' H 



4.4 Partial Adhesive Categories 

Adamek sums up a procedure by which many categories are defined in Abstract and Concrete Cate- 
gories H: 

Many familiar constructs of an "algebraic" or "topological" nature have natural de- 
scriptions that can be accomplished in two steps. The first step [...] consists of defining 
algebraic (resp. topological) categories by means of certain functors. The second step 
consists of singling out full, concrete subcategories by imposing certain axioms on the 
objects. 

We shall follow this prescription to construct the category of string graphs in the next chapter. 



We already saw in Examples 4.3.3 how adhesivity is inherited by categories defined "by means 
of certain functors": namely slice, coslice, and functor categories. There is no reason for a full 
subcategory of an adhesive category to also be adhesive. However, we can still characterise certain 
subcategories of an adhesive category that inherit "enough adhesiveness" to do rewriting. 

Definition 4.4.1 (Partial Adhesive Category). C is called a partial adhesive category if it is a full sub- 
category of an adhesive category A and the embedding functor S : C — » A preserves monomor- 
phisms. 
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The category C inherits the unique pushout complement property for a certain class of pushouts 
in C, which we shall call S-pushouts. 

Definition 4.4.2 (S-spans and S-pushouts). Let C be a partial adhesive category and S : C — > A the 
embedding functor. A span A < — B — > C in C is called an S-span if it has a pushout and that 
pushout is preserved by S. Such pushouts are called S-pushouts. 

Recall that full and faithful functors reflect colimits. Since S reflects all pushouts, we could 
equivalently define S-spans in C as those spans which have a pushout reflected by S. 

Definition 4.4.3 (S-pushout complement). An S-pushout complement for a pair of arrows (b,f) is a 
pushout complement, where the following diagram is an S-pushout: 

b 



I 



G' 



r 



/ 



G 



We call b the boundary of L and c the coboundary of L in G. 

Lemma 4.4.4. If a pair of arrows (b,f), where b is mono, have an S-pushout complement, it is unique up to 
isomorphism. 

Proof. Let (c,g) and (c',g') be S-pushout complements. Then the following diagrams are pushouts 
in the adhesive category A: 



SB - 


^SL 


SB - 


-^. 


c 






Sf 


Sc' 






s 


S( 


■ 
-i 


r 
s 


G 


1 

S( 


■ 


r ' 
s 


G 



Sf 



$s Sg> 

Since S preserves monos, these are both pushout complements of (Sb,Sf) for Sb mono. So the 
following diagram commutes in A, for cp' an isomorphism: 

Sc 




Since S is full and faithful, there exists an isomorphism <p : G' — » G" such that S<p = <p' . Replac- 
ing <p' in the above diagram yields: 
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SG" — -^ SG 



Diagram 1 4.6 1 commutes by the faithfulness of S. 



a 



Definition 4.4.5 (S-matching). For a rewrite rule L — > R, a monomorphism m : L — >■ G is called an 
S-matching if B — i L — > G has an S-pushout complement. 

Clearly if m is an S-matching, then Sm is a matching. For the converse to be true, it suffices for 
the image of S to be closed under subobjects. 

h h 

Definition 4.4.6 (S-rewrite). Let L — > R := L < — I — > R be a rewrite rule and m : L — >■ G be an 
S-matching. Then for G' the S-pushout complement of B — > L — > G, if the right pushout square 
in the following diagram exists and is an S-pushout: 



R 



~l 



G' 



r 



H 



Then H is the S-rewrite of L — > R at m. 
We write H defined as in 



4.4.6 



as G[L — > R] m , or more explicitly G[L <— — I — h- R] m . 



4.4.1 Example: The Category of Simple Graphs 

Partial adhesive categories should be thought of as adhesive categories, with some extra axioms 
imposed on the objects. In the presence of these axioms, one may need to verify by hand the 
relevant S-pushouts and S-pushout complements exist for a particular class of rewrite rules or 
matchings. In practice, this tends to be fairly straightforward. In this section, we give the derivation 
of these properties for the category of simple graphs. 

Let Gr be the category of simple graphs, i.e. graphs where every pair of vertices is connected by 
at most one edge in either direction. Equivalently, a simple graph is just a binary relation from a set 
to itself. An object in Gr consists of a set V of vertices, E of edges and an injection e : E e -> V x V. 
A simple graph homomorphism is a pair /V,/e such that 



h 



t 



V c xV c 

fv xfv 
V H xV H 
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There is an evident embedding of Gr into Graph: 

S : (V G ,E G ,e : E -> V) \-+ (V G , E G , n 1 o e, n 2 o e) 

Under the identifications s = K\ o e, t = 712 o e, the notions of graph homomorphism in Gr and 
Graph are equivalent, so S is a full subcategory embedding. Gr is a reflective subcategory of Graph, 
so S has a left adjoint. As a right adjoint, S preserves limits and, in particular, monomorphisms. 
Therefore Gr is a partial adhesive category. 

Lemma 4.4.7. Let A < — B — > Cbea span in Gr, where m is a mono and n is a regular mono. Then m, n 
has an S-pushout. 

Proof. Since S is full and faithful, it suffices to show for m a mono, n a regular mono, that D defined 
by the following pushout in Graph is a simple graph. 



s 


A - 


Sm 

►• 5 


B 


Sn 










■ 


r + 


S 


C - 


1 


) 



We can consider D to be a union of a simple graph SB and another simple graph SC. Regular 
monos in Gr are precisely the full subgraph embeddings, so SA = SB D SC is a full subgraph of 
SC. Consider two vertices v, v' in D and edges e, e' such that s(e) = s(e') = v and t{e) = t(e') = v'. 
The only way these edges can possibly be distinct is if e is in SB and e' is in SC. Thus v and v' are 
in SB n SC. Since the intersection is a full subgraph of SC, e' is in SB n SC, so e = e' . □ 

We can an S-matching that is a regular a monomorphism a regular S-matching. We can show that 
when m is a regular S-matching in Gr then the associated S-rewrite exists and is unique. 

Theorem 4.4.8. For a rewrite rule L — > R := L < — I — > R and a regular S-matching m : L — >■ G, the 
associated S-rewrite is well-defined. That is, the following two S-pushout squares exist: 

L B ► R 

m n (4.7) 

In I r " 

G G' " H 

Proof. The existence of the left S-pushout square follows from the fact that any subgraph of a simple 
graph is also a simple graph. Since the following is a pushout along a monomorphism in Graph, it 



is also a pullback, by Lemma 4.3.6 
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Sbi 

SI SL 



Sn 
SG' 



Sm 



Sf 



r + 

- SG 



Since S is full and faithful, it reflects pullbacks. So the following is a pullback in Gr. 



J 



G' 



/ 



Regular monos are stable under pullback, so n is a regular mono. By Lemma 4.4.7 the right 
square in ||4.7) is an S-pushout. D 



Thus, if we restrict to regular monomorphisms for marchings, DPO rewriting in the partial 
adhesive category Gr is well-defined, and the procedure is identical to that in the category Graph. 

4.4.2 Commutation of S-Pushouts and S-Pushout Complements 

Often we wish to glue objects together using S-pushouts. If we perform an S-rewrite that is con- 
fined to a single component of this glued-together object, it should not matter if we first apply the 
rewrite then compose or if we first compose then apply the rewrite. In later sections, we shall de- 
fine categories whose morphisms consist of graphs modulo a rewrite system. In such categories, 
this commutation of gluing and rewriting is crucial to ensuring that the composition operation 
well-defined. Therefore, we now prove two lemmas regarding the compatibility of S-pushouts, 
S-pushout complements, and S-rewrites. 

Lemma 4.4.9. S-pushout complements commute with S-pushouts. Let the following diagram be an S- 
pushout: 

i 

P H 



r 



b in b 'i ffi 

Assume B — > K — > G and B — > K — > G + P/(? H both have pushout complements, 



K 



K 



G — b,m K s 



r + 

— G 



(G +p t q H) —}, t i im K 



i\m 



r 

G +p,q H 
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and the S-span (p,q) factors through G —b, m K, i.e. there exists p' such that sp' = p and (p',q) is an 
S-span. 

Then, for a second S-pushout: 

1 



H 



r 



]2 



G — : — ► (G - bittl K) +„,„ H 
h 



there is an isomorphism <p : (G +p /(? H) — ^ m K —> (G — j, „, K) + v ',q H, commuting with the coboundaries 
c and c' ofK in G and G +p /(? H respectively. 



b,m 



K 



h 



(G + P/1? H) —\, t i xm K 

<P 
(G —b, m K) +p' /(? H 



(4.8) 



Proof. The proof follows from the associativity of pushouts and the uniqueness of pushout comple- 
ments. First, note that, in the following diagram, [1] commutes and is a pushout because sp' — p: 

1 
P - H 



b.m 



K [1] 



r + 



K 



Next, we make the two pushouts in the opposite order. 

"7 



'2 



r 

G +p,q H 



H 



r 
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b,m 



K 



[2] 



(G -b,m K) +pi t q H 
k 2 



r 



x 



Q 



By associativity of pushouts, there exists an isomorphism ip such that: 
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G +p /(? H 



H 



i\m 



k 2 j 2 



K 



Q 



Since Q is only defined up to isomorphism, we are free to take ip — 1q+ h> m which case 
k\ — i\m. Then, square [2] from above becomes: 



B 



K 



i\jn 



(G —\,,m K) +pi i q H 
k 2 

r " 

G+ p , q H 



Compare this to the definition of (G + P/ q H) —\)U m K as a pushout complement: 



(G + Piq H) — jy im K 



t 
K 



i\m 



r 

G +p,q H 



The result then follows from uniqueness of pushout complements. 



n 



Theorem 4.4.10. S-adhesive rewrites commute with S-adhesivepushouts. Let m : L — >■ Gbean S-matching 
ofL — > R:= L i — I — ^ R. The rewrite is computed as the double pushout: 

b\ b 2 
L - B R 



~l 



r 



G 



G-b, m L ;-— G[L —>R], 



Let (p,q), (p,cj) and (p',q) be three S-spans, such that: 



G -b v m L " P H 



G[L^_R], 




(4.9) 



Then, for the pushout injection i\ : G — > G + P/1? H, jf /jm z's mono, the following is an isomorphism: 

(G[L^> R] m ) + p , iq H^(G + M H)M R] hm 
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Proof. G[L — > R] m is the pushout of R and G — \, m L along B, so by uniqueness of pushout comple- 
ments, we can choose (G —&,,»! ^) to ^e equal to ({G[L — > R] m ) —b 2 ,m' ^)/ f° r me same coboundary 
c. Then, by two applications of Lemma 4.4.9 we can choose (G — j, w L) +g „ H = ( (G [L 



R] 



hz,m' 



R) +$q H as the pushout complement of both of the following squares. 
L B R 



i\m 



G +p,q H 



b\,m 



i)+M H 



G[L- 



R] 



V A 



H 



Note that c' becomes the coboundary for both squares because diagram 1 4.8 1 commutes. This 
is then exactly the computation of the rewrite (G +p,q H)[L — > R]i im . The theorem holds because 
S-rewrites are unique up to isomorphism. D 

We shall use these two theorems throughout this dissertation to show that rewriting is compat- 
ible with several notions of composing graphs. 
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Chapter 5 

String Graphs and Monoidal Theories 



In this chapter, we come to one of the primary contributions of this dissertation: the formalisation of 
the diagrammatic language of monoidal categories using certain typed graphs called string graphs. 
In the previous chapter, we introduced the notion of a partial adhesive category. Our primary 
reason for doing so was to enable double-pushout graph rewriting in the category of string graphs. 
Before passing to string graphs, we will look at the category of typed graphs, defined as a slice 
category Graph/Gj-. Objects of the slice category are pairs (G, Tq : G —> Gj). These should be 
thought of as a graph G along with a map Tq giving a type in Gj to every vertex and edge in G 
Morphisms are simply graph homomorphisms respecting this type map: f : G — » H such that 
Tn o f = Tq. It may seem odd at first that the type map is a graph homomorphism rather than 
simply a pair of functions defined on vertices and edges. However, the first can encode the latter, 
so these "unresticted" vertex and edge typings are merely a special case of homomorphic graph 
typings. As an example, suppose we fix a set X of vertex types and have only one edge type. Then 
Gj can be defined as the connected graph whose vertices are the elements of X. For instance, if we 
let X = {black, white, grey}, we can form a connected graph: 



black 



white 



grey 



Since Gj is isomorphic to the totally connected directed graph with 3 vertices, any function 
Ty : Vq — > Vq t extends uniquely to a graph homomorphism t : G — » Gj. We can think of the 
fibres of Ty (i.e. the inverse images Ty (black), Ty (white), and Ty (grey)) as sets of vertices in G 
that are coloured "black", "white", or "grey" respectively. We can represent a graph with coloured 
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vertices as a pair consisting of a graph G and a typing function t. 





\ 



V\ \-> white, V2 h-> grey,l 
11 B3H black, va, i-> grey J 



/ 

Typegraphs can express edge typing as well. For instance, to add a set of edge types Y = 
{+, —}, we can simply add a copy of Y for every pair of vertices (v\,V2) in Gj connecting ci to vi- 



\j*t° '.- 




One can even restrict which vertex types can be connected by which edge types by taking sub- 
graphs of Gj. In the coming sections, we will use this restriction to make sure that our diagrams 
are well-typed, i.e. composition of "boxes" in the language of string diagrams should respect the 
types on "wires". 

5.1 String Graphs 

String diagrams consist of boxes, which represent morphisms in a monoidal category, and wires, 
which are used to connect boxes together. We turn string diagrams into a string graphs, which 
are typed graphs with two distinguished kinds of vertex: box-vertices and wire-vertices. As the 
name suggests, box-vertices represent the boxes (i.e. morphisms /generators) in the diagrammatic 
language. These represent the "logical" or "semantic" vertices of the string graph. An important 
characteristic of wires, which distinguishes them from normal edges in a graph, is that they are not 
required to have boxes at both ends and they can be connected to themselves to form circles. For 
that reason, we will represent wires as chains of special "place-holder" vertices called wire-vertices. 




Note how the wire-vertices carry the type of the wire. Representing boxes as box-vertices, it is 
possible to translate any string diagram into a string graph. 
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\-> 




Edge types (not shown) are used to keep track of the ordering of inputs and outputs to the boxes 
(see Definition 5.1.1} . 

The number of wire-vertices making up a wire is irrelevant, so for the purposes of representing 
a string diagram, the following two string graphs are equivalent: 




r^j 





If we were to treat the two graphs above as ID simplicial complexes that define topological 
graphs, then the geometric realisation of the wires on the left are homeomorphic to those of the 
wires on the right. For that reason, the two graphs above are called wire-homeomorphic. We can for- 
malise the notion of wire-homeomorphism as a confluent, terminating graph rewrite system and 
prove that string graphs, up to wire-homeomorphism can be used to construct free monoidal cate- 
gories. This construction is essentially the graph version of the topological construction described 
in Section|232] 

For a monoidal signature T, we can define a category SGraph T of string graphs with generators 
taken from T. We do this by turning the monoidal signature T into a typegraph Gj and defining 
SGraph T is a full subcategory of Graph/ Gj. We shall then prove that the embedding of SGraph T 
into Graph/ T preserves monos, so SGraph T is a partial adhesive category. 
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First, we show the construction of Gj. Recall that a monoidal signature T = (O, M,dom,cod) 
consists of sets O, M and functions dom : M — > w{0), cod : M — s> w(O) into the set w(0) of lists of 
elements of O. 

Definition 5.1.1. For a monoidal signature T, the derived typegraph Gj of T has vertices O + M and 
the following edges: 

• a self -loop midx for every X € O, 

• an edge in^ for / E M, < i < Length(dom(/) ), connecting dom(/) [i] to /, and 

• an edge out f,- for/ eM,0<j'< Length (cod (/)), connecting / to cod (/)[/]. 
Example 5.1.2. Let T be the following monoidal signature: 



T:~- 



A B 

I I 


C 

I 


/ 


/ 


g 


4- 
C 


4. 
C 



Then, the derived typegraph Gj is: 




Definition 5.1.3. Let (G, T : G — > Gy) be an object in Graph/ Gj, for a monoidal signature T. By 
definition of Gj, the vertices of Gj are O + M. A vertex p G Vq is called a box-vertex of ty (o) 6 M. 
It is called a wire-vertex if Xy{v) 6 O. Let B(G) C V G be the set of all box-vertices, and W(G) be the 
set of wire-vertices. 

Note that since we have omitted self-loops on box-vertices in Gj, any path between two box- 
vertices in a Gj-typed graph must pass through a wire-vertex. This is important to the definition of 
string graphs, as the "object" types from the signature T are carried by wire-vertices. 

There are two restrictions that we place on typed graphs (G, t) to make string graphs. The first 
is that wires should not split or merge. Namely, any wire-vertex in G should have at most one 
in-edge and one out-edge. In other words, for a graph G = (Vq, Eq,s, t), the restrictions of s and 
t to the wire vertices W(G) are both monomorphisms, i.e. s', t' defined by the pullbacks below are 
monomorphisms: 



E' > W{G) — E" 

J r L 

III 
Eg g — - V G « — E G 

The second condition is that the incident edges of a box-vertex b in G should match those of its 
type, Ty(b). In other words, there should be the same number of inputs and outputs to b as there 
are to ty (b). We formalise this condition by introducing the notion of a local isomorphism. 

Definition 5.1.4. For a vertex v £ Vq, the edge neighbourhood of v is the set of edges N(v) := 

s- 1 (^)ur 1 (^). 

Fix a graph homomorphism f : G —t H. Then for a vertex v in G and an adjacent edge e, the 
edge/(e) is adjacent to f(v). Thus f E (N(v)) C N(f(v)). Let f v : N(v) ->■ N(f(v)) be the function 
defined by / D (e) = /^(e), for e 6 N(v). 

Definition 5.1.5. For Gr-typed graphs (G, t g ), (H, Th), a typed graph homomorphism f : G —t H 
is called a /oca/ isomorphism if for every box fr 6 B(G), / fc : N(b) — >■ N(f(b)) is a bijection. 

In particular for a Gj-typed graph (G, t), the typing map T can be considered as a typed graph 
homomorphism from (G, t) to (Gj, 1g t )- Thus, we can require that it be a local isomorphism. 

Definition 5.1.6. A Gj-typed graph (G, t) is called a string graph if t is a local isomorphism and 
every wire-vertex in G has at most one in-edge and one out-edge. The category SGraph T is the full 
subcategory of Graph/ T whose objects are string graphs. 

Since the typing maps T in SGraph T are local isomorphisms, we can show that every arrow in 
SGraph T is a local isomorphism. 

Proposition 5.1.7. Every arrow in SGraph T is a local isomorphism. 

Proof. Let (G, Xq), (H,t h ) be Gj-typed graphs. By definition t g and r H are both local isomor- 
phisms. For any f : (G, Tg) — >■ (H, Th) in Graph/Gj, the following diagram commutes: 

G ►• Gj 




f 
H 

Thus, for any v in G we get this triangle in Set: 
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N(t G (v)) 



N(f(v)) 

Since Xq and t^' are both bijections, f is a bijection, so / is a local isomorphism. □ 

Proposition 5.1.8. A morphism in SGraph T is a monomorphism iff it is injective. 

Proof. Suppose w : G -fHin SGraph T is not injective. m is a local isomorphism, so if m takes 
two distinct edges e\ and ei to the same edge, then the adjacent vertices of e\ and e-i must also be 
distinct. 

To show that m cannot be mono, we will define a string graph K and distinct maps f,g:K—> G 
such that mf = mg. If m takes two distinct box-vertices V\, Vi in G to a single box-vertex in H, then 
let K be the subgraph of G consisting of just v\ and its neighbourhood. If m takes two distinct wire- 
vertices to a single wire-vertex in H, then let K be a T-string graph consisting of a single, isolated 
wire-vertex. In either case, there are two distinct maps f, g such that mf = mg. □ 

Corollary 5.1.9. SGraph r is a partial adhesive category. 

5.1.1 S-pushouts of String Graphs 

For the constructions to come, it is useful to characterise the S-pushouts in SGraph T . To simplify 
matters, it suffices to characterise the S-pushouts of spans G < — K — > H where m and n are both 
monomorphisms. We show that a span of monos is an S-span if and only if it is boundary-coherent. 
Before we define boundary-coherence, we need the notion of a boundary. 

Definitions 5.1.10. If a wire-vertex has no in-edges, it is called an input. We write the set of inputs 
of a string graph G as In(G). Similarly, a wire-vertex with no out-edges is called an output, and the 
set of outputs is written Out(G). The inputs and outputs define a string graph's boundary, written 
Bound (G) . If a boundary vertex has no in-edges and no out-edges, (it is both and input and output) 
it is called an isolated wire-vertex. An string graph consisting of only isolated wire-vertices is called 
a point graph. 

By abuse of notation, we may treat In(G), Out(G), and Bound(G) as sets or as point graphs. 
The intended usage will be clear from context. 

Definition 5.1.11. A pair of morphisms / : K — > G, g : K ^ H in SGraph T is called boundary- 
coherent if: 

1. for all v 6 In(K) at least one of f(v) and g(v) is an input, and 
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2. for all v 6 Out(K) at least one of /(c) and g(v) is an output. 

Theorem 5.1.12. A span of monomorphisms G < — I — > H in SGraph T has an S-pushout if and only if 
m and n are boundary-coherent. 

Proof. For (<=), it suffices to show that K is a string graph, for the following pushout in Graph/Gj-: 

SI^^SG 



Sn 



g (5.1) 



r 

SH ► K 

f 

Since X is isomorphic to the union of SG and SH, the images of /and g cover K. Let/(fc) 6 B(K) 
be a box-vertex in the image of / but not g. Then the neighbourhood of b is identical to that of f(b), 
so the inclusion of edges f h : N(b) — > N(f(b)) is a bijection. Similarly, for g(b) £ B(K) not in 
the image of /, g : N(b) — > N(g(b)) is a bijection. Finally, pick a box-vertex b 6 B(I), then since 
m and n are local isomorphisms, Sm and Sn restrict to bijections on N(b) C Ej. Thus, all of the 
edges in N(Sm(b)) are identified with edges in N(Sn(b)) in K, and the inclusions / and g restrict 
to bijections on N(Sm(b)) and N(Sn(b)) respectively. So / and g are local isomorphisms. Since the 
images of / and g cover K, it follows that the typing map Tr is a local isomorphism (cf . the proof of 
Proposition 5.1.7). 



Now, suppose a wire-vertex v 6 W(K) has out-edges e%, e-i- The only way these can possibly 
be distinct is if v is in the image of both / and g. Then, there must be a v' such that f o Sn(v ) = 
g o Sm(v') = v. If v' is not an output in I, then the out-edges of m(v') 6 B(G) and n(v') 6 B(H) 
must be in the images of m and n, respectively, so e\ = ei- If it is an output, then at least one of 
m(v'), n(v') must be an output, so v has at most one out-edge. We can show similarly that v must 
have at most one in-edge. So K is a string graph. 

For (=>), suppose the span G < — I — > H is not boundary-coherent. If the pushout K given 
by < |5.1[ is not a string graph, then either (a) the span m, n does not have a pushout or (b) m, n does 
have a pushout, but it is not preserved by S. In either case, the span does not have an S-pushout, 
so it suffices to show that K is not a string graph. If the span m, n is not boundary-coherent, there 
exists wire-vertex v in I such that either (a) v is an input and f(v) and g{v) both have in-edges or 
(b) V is an output and f(v) and g(v) both have out-edges. If (a) is true, then the image of V will 
have two distinct in-edges in K. If (b) is true, it will have two distinct out-edges. In either case, K is 
not a string graph. □ 

Example 5.1.13. Consider the following span of string graphs, which is not boundary-coherent: 
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j 



. V2 * V2 Xvz 

*v 3 lv 3 tv 3 

* V4. iv' 4 

If we push out this span in Graph/ Gj, we get: 

r 
a: 

which is clearly not a string graph. Therefore this span does not have an S-pushout. 

Note that the empty graph is the initial object in SGraph T , so we have an easy corollary. 
Corollary 5.1.14. SGraph T has finite coproducts and S preserves them. 

Proof. The proof follows from the observation that for any string graphs G, H, the span of initial 

1 1 

arrows G <— — — > H is trivially boundary-coherent. □ 

One particularly important type of boundary-coherent span is a plugging. These are used to 
"plug" the inputs of one graph into the outputs of another graph. 

Definition 5.1.15. A boundary-coherent span G < — P — > H where P is a point graph and G and 
H contain no isolated vertices is called a plugging. 

Recall that wire-vertices in a point graph are both inputs and outputs. Thus boundary-coherence 
forces the images m(p) and n(p) of a wire-vertex p 6 P to have opposite polarities. That is to say 
exactly one of m(p),n(p) is an input and exactly one is an output. 

Example 5.1.16. The following S-span defines a plugging: 



Pushing out the span yields a string graph with the two smaller graphs plugged together: 
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5.2 Rewriting with String Graphs 

If a string graph contains no isolated points, then the set of inputs and outputs is disjoint, i.e. 
Bound (G) = In(G) + Out(G). We will define string graph rewrite rules as pairs of string graphs 
with no isolated wire-vertices whose inputs and outputs are in bijection. Such a pair uniquely 
induces a span L A- I ■% R for I S In(L) + Out(L) S ln(R) + Out(R). 

h b 

Definition 5.2.1. A string graph rewrite rule L — > R is a span L < — I — — >• R where: 

1 . L and R contain no isolated wire-vertices, 

2. In(L) S* In(K), Out(L) S Out(R), 

3. I = In(L) + Out(L) S In(K) + Out(R), and 

4. the following diagram commutes for b\ and \>i the induced maps of the coproduct inclusions 
i,j and i',f respectively: 



L * 



In(L) 



Out(L) 



MR) 




Out(R) 



Since L contains no isolated wire-vertices, the images of In(L) and Out(L) are disjoint, so b\ is 
injective. For the same reason, \>i is also injective. 

Example 5.2.2. Let / : A — > A ® A and g : A <g> A — > A be morphisms in a monoidal category such 
that fog = \ A . This can be expressed as an equation between string diagrams: 



...or as a rewrite rule between string graphs: 
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h h 

Theorem 5.2.3. Let L — > R := L < — I — ^ R be a string graph rewrite rule. Any monomorphism 
m : L — >■ G is an S-matching. 

Proof. Since m is a local isomorphism and l?i covers the boundary of L, m satisfies the no-dangling- 
edges condition, so it has a pushout complement G' in Graph/ Gj- For m to be an S-matching, it 
suffices to show that G' is a string graph. I consists only of wire-vertices, so by the no-dangling- 
edges condition, the adjacent edges of the box-vertices in G' are the same as they were in G, so the 
typing map Tqi is still a local isomorphism. The fact that G' is a string graph then follows from it 
being a subgraph of G. □ 

5.2.1 Wires and Wire-Homeomorphism 

String graphs are meant to be the discrete version of (topological) string diagrams. In string dia- 
grams, wires can be thought of as copies of the unit interval [0, 1] C M, considered as an oriented 
manifold. Boxes are distinguished points, to which we ascribe semantic meaning. Clearly if we 
replace a wire in a string diagram with a homeomorphic wire, we get the same string diagram. In 
other words, the meaning of a string diagram is unaffected by shortening or lengthening wires. 

A simple chain is a connected, acyclic graph where each vertex has at most one in-edge and one 
out-edge. A vertex in a chain with only an in-edge or only an out-edge is called an endpoint. A 
simple cycle is connected graph where each vertex has exactly one in-edge and one out-edge. 

Definition 5.2.4. For a string graph G, a closed wire W C G is a simple cycle of wire-points or a 
simple chain such that 

1 . the endpoints of W are either box-vertices or in the boundary of G, and 

2. all other vertices in W are wire-vertices. 

It is worth noting that wires are not necessarily string graphs, as their typing function need not 
be a local isomorphism at the endpoints. 

Example 5.2.5. A string graph and its 4 closed wires: 




Every wire-vertex in a wire will be of a single type in Gj. This is called the wire type. 
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Definition 5.2.6. Two wires W and W' are said to be homeomorphic if they have the same wire type 
and (a) they are both simple cycles, or (b) they are simple chains where the endpoints and edges 
adjacent to the endpoints are of the same type. 

Informally we justify this terminology by noting that, if we consider W and ffl as simplicial 
complexes, then for homeomorphic wires, the geometric realisations of the two complexes as topo- 
logical spaces (i.e. as a circle or the unit interval) are homeomorphic. 

Definition 5.2.7. Two string graphs G and G' are called wire-homeomorphic if G' can be obtained 
from G by replacing any number of closed wires W with homeomorphic wires W'. 

For any string graph G, there is a unique smallest wire-homeomorphic graph G J, obtained by 
replacing every wire in G with a homeomorpic wire containing a single wire-vertex. We justify the 
notation G\. by formalising wire-homeomorphism using a rewrite system H. 

Definition 5.2.8. For a monoidal signature T = (O, M, dom, cod), the rewrite system H is defined 
as follows. For every X £ O, we define a loop contraction rule h 1 ^ and a wire contraction rule h^: 



a 

l x 



■AV 



For every/ 6 M and < z < Length (dom (/)), < j < Length(cod(/)), we define an input 
contraction rule hi • and an output contraction rule h.9 •: 






Proposition 5.2.9. Two string graphs G, H are wire-homeomorphic if and only if G < — > h H. 

Proof. For (<?=), we observe the all of the rules in H leave the endpoints of a wire fixed, while 
decreasing the number of other wire-points. For (=>), it suffices to show that H lets us increase or 
decrease the number of wire-vertices in any wire in G. Consider the types of wire W that can occur 
in G: 

1. a simple cycle, 

2. a chain starting with a boundary and ending with a boundary, 

3. a chain ending with a box-vertex, or 

4. a chain starting with a box-vertex. 
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These alternatives are not mutually exclusive, but they are exhaustive. In the case of (1.) the 
rules hh or hy can always decrease the size of W if applied forwards, and increase the size of W if 
applied backwards. In the case of (2.) apply h™, for (3.) apply hi •, and for (4.) apply h9 ,. D 

Lemma 5.2.10. The rewrite system H is confluent (up to graph isomorphism) and terminating. 

Proof. Termination comes from observing that each contraction rule strictly decreases the total 
number of wire-vertices. Confluence follows from noting that any forward-directed rewrite proce- 
dure starting with G terminates at the unique minimal wire-homeomorphic graph G\.. □ 



Example 5.2.11. Normalising a string graph with respect to H: 

* * y * 



7/ u ' 
"x 



h° 
ffi 



7,1 




Note that there are multiple ways this string graph could be normalised, but since H is confluent, 
the end result will always be the same. 

5.3 Cospans over the Category of String Graphs 

For any category C with pushouts, we can form its cospan bicategory Csp(C). The 0-cells of Csp(C) 

/ ? 

are the objects from C, the 1-cells are cospans X — > F <. — Y, and the 2-cells are cospan homo- 

f S f ?' 

morphisms. A cospan homomorphism from X — > F i — Y to X — > F ^ — Y is a morphism 

a : F — > F' in C that commutes with the cospan maps. 



X 



/' 



F' 



Y 



f S 

2-cell composition is the usual composition of morphisms in C. For two cospans X — > F < — Y 



and Y A G 4- 



Z, the composition is formed by pushing out over Y. 
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X 




The composed cospan is then X — > G o F < — Z. Defining Idx := X, identity 1 -cells are 
cospans of identity maps: X — > Idx < — X. It follows from general properties of pushouts that 
the following are cospan isomorphisms: 

(HoG)oF^Ho(GoF) Id Y oG = G = GoId x 

We can form the ordinary category csp(C) by taking objects to be the 0-cells from Csp(C) and 
arrows to be isomorphism-classes of 1-cells in Csp(C). 

Sobocihski and Sassone [64. 61 1 have extensively studied rewrite systems in the context of bicat- 
egories, and in particular bicategories of cospans over adhesive categories. In this section, we will 



focus particular types of cospan constructions over string graphs that will be used in sections 5.4 



and 5.5 to construct free monoidal categories and free monoidal categories containing algebraic 



structures. 

Recall that for string graphs, certain S-pushouts called pluggings perform the function of com- 

d c 

position. Using cospans of string graphs X — > G < — Y where X and Y are point graphs, we 
can "pin" the inputs and outputs for a particular graph in place (i.e. distinguish domain from 
codomain and induce a total order) and allow us to define composition unambiguously. To ensure 
that the cospans we consider are meaningful in terms of morphisms in monoidal categories and 
every cospan composition is a plugging, we shall restrict our attention to framed point graphs and 
framed cospans in SGraph r . 

Definition 5.3.1. A framed point graph is a triple (X, <,sgn) where X is a point graph, < is a total 

d c 

order on Vx, and sgn : Vx — >■ {+, — } is called a signing map. A cospan X — > G < — Y is called a 
framed cospan if: 

1 . X and Y are framed point graphs, 

2. G contains no isolated wire-vertices, 

3. the induced map [d,c] : X + Y ^ G restricts to an isomorphism [d, c]' : X + Y ^> Bound (G), 

4. for every v € Vx, d(v) 6 In(G) <=> sgn(i?) = +, and 
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5. for every v E V Y , c(v) E Out(G) <=> sgn(c) = +. 

The function sgn assigns a polarity to each element of the boundary. A positive polarity marks 
a wire that runs in the usual (downward) direction, whereas a negative polarity marks a wire that 
runs in the dual (upward) direction. 

Notation 5.3.2. For a framed point graph X, let X* be the same framed point graph with the signs 
reversed. 

A totally downward-directed framed cospan is called positive. 

Definition 5.3.3. A framed point graph X is called positive if sgn(v) = + for all v E Vx- A framed 

d c 

cospan X — > G < — Y is called positive if both X and Y are positive. 



Proposition 5.3.4. For two framed cospans: 

d c 

c d' 

The span G < — Y — > H is a 



Y 



r> 

n^—z 



Proof. For each v E Vy, if sgn(z?) = + then c(v) E Out(G) and d{v) E In(H). If sgn(z?) = — , then 

c d' 

c(v) E rn(G) and d(v) E Out(H). Thus G < — Y — > H is boundary-coherent. Y is a point graph, 
so the span is a plugging. □ 

As a consequence, for framed cospans G, H, the composition H o G exists and is computed by 
S-pushout. 

Example 5.3.5. Composing framed cospans by plugging: 



H 





}HoG 



i 1 



1 1 

+mC +»D 

Definition 5.3.6. For a framed point graph (X, <,sgn), the pseudo-identity cospan 

x -A i x <A- x 
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s(e v ) 



is constructed as follows. Wx has vertices Vx + Vx- d ■ X — » lx maps the vertices of X into the first 
copy and c : X — >• lx maps them into the second copy. E% x = {e v : v 6 Vx} an d: 

d(v) if sgn(c) = + 
c(v) if sgn(c) = — 

c(v) if sgn(c) = + 
d(v) if sgn(c) = — 

If we try to form the category of framed cospans over SGraph T , we run into a problem. For the 
identity cospans X — > Idx < — X, the string graph Idx contains isolated wire-vertices, so they are 
not framed cospans. In other words, the "category" of framed cospans has no identities! However, 
there are cospans that come quite close to identity maps. 

Example 5.3.7. Let X be a framed point graph: 

V ABC 

X := • • • 



The pseudo-identity lx is defined as: 



X 



X 



I I 

B »C 



>B «C 



These are called pseudo-identities because composing with them yields a string graph that is 
wire-homeomorphic to the original graph. 




i 


i 






- • A 


-•B 


,a 


• 


J 


I 




! 


?■' 


? B 


; 






1 




1 A 


• G 


-»A 


-• 


: 


: 







In order to obtain honest identities from pseudo-identities, we shall define a category of framed 
cospans modulo a rewrite system, called a rewrite category. 

5.4 Rewriting on Cospans and Rewrite Categories 

Rewrite categories are categories of framed cospans, modulo a rewrite system. For these to be 
well-defined, we need to define a notion of rewriting on cospans and show that rewriting cospans 
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is compatible with cospan composition. In particular, we show for \G\ an equivalence class of 
cospans over a given rewrite system, we can define | H \ o | G \ : = |HoG|ina way that does not 
depend on the choice of representatives G and H. 

d c d' c' 

Definition 5.4.1. Let X — > G < — Y and X — > H < — Y be cospans in SGraph r , and let L — > R 

be a string graph rewrite rule. For a matching m, the following rewrite: 



m 



t 
G 



B 



G' 



R 



r + 



is called a cospan rewrite if the maps z'i and i - 2 are cospan homomorphisms. That is, there exist maps 
d, c such that the following diagram commutes: 




X 



G' 



*- It ■ -«- 



Y 





H 



We can show that any rewrite on string graphs lifts to a cospan rewrite, for unique maps d, c. 
This result relies on the fact that morphisms in a framed cospan factor uniquely through the S- 
pushout complements associated with a string graph rewrite. 



Lemma 5.4.2. Let L < — I — > Rbe a string graph rewrite rule, X — > G < — 
m : L — > G an S-matching on G. Then, for the associated S-pushout complement: 

h 

I L 



Y a framed cospan, and 



G' 



r 



the cospan maps d and c factor uniquely through i. That is, there exists unique d,csuch that: 

d 



X 



G' * 




Y 
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Proof. G' is the subgraph of G that has the interior of L removed and i is the inclusion of G' in G. 
For any vertex v in the interior of L, v cannot be in Bound (L) by definition of string graph rewrite 
rule. Thus, there exists no graph homomorphism m such that m[v) 6 Bound(G). By the definition 
of framed cospan, the image of d is contained in Bound (G), so for all v' 6 Vx, d(v') 6 Vqi. Letting 
d(v') = d(v'), we have i o a = d. The existence of c follows similarly. Uniqueness is automatic, 
since z is a monomorphism. D 

Corollary 5.4.3. A string graph rewrite G — > H lifts uniquely to a rewrite of framed cospans: 



X 



Y 



X 



H 



Y 



Notation 5.4.4. For a string graph rewrite system 1Z and a cospan X 
for the set of cospans G' such that: 



Y, we write \G\-r 



X^G^Y <- 



■ n X^G'^Y 



We drop the subscript 1Z when it is clear from context. 

Theorem 5.4.5. Rewriting commutes with composition. Let the following diagram define a composition of 
framed cospans: 

Y 



(5.2) 




HoG 

For a string graph rewrite rule L — > _R and an S-matching m : L - 
i\m : L — > H o G is an S-matching and: 

Ho (G[L-> R] m ) £ (Ho G)[L-> R] hm 

Similarly, for n : L — >■ H, h n '■ L — > H o G is an S-matching and: 

(H[L^> R}„) o G = (H o G)[L-> R] hn 



G, the composed morphism 



(5.3) 



(5.4) 



Proof. i\ is a monomorphism because the pushout in | |5.2| is a plugging, so i\m is a monomorphism, 
hence an S-matching by Theorem 5.2.3 By Lemma |5.4.2 the map c\ factors through the pushout 
complement G —\, x ,m L as in diagram |4.9|: 
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G *-_ 


\Ci 






i. 










C\ 




do 


b\,m *-• 




- Y - 


K 



G\L- 



R], 



All three pairs {c\,d), (ci,d), and (c[,d) are pluggings, so they are S-spans. Isomorphism 1 5.3 1 



follows from Theorem 4.4.10 Isomorphism ||5.4} follows similarly. 



□ 



Corollary 5.4.6. For a string graph rewrite system 1Z, composition given by \H\-% o \G\n := \H o G\n is 
well defined, and does not depend on the choices of G and H. 



Proof. If G' 6 \G\-ji and H' E \G\%, then by Theorem 5.4.5 we can always find matchings to rewrite 
H' o G' into H o G, so H' o G' E \H o G\ K . □ 

Theorem 5.4.7. For a rewrite system 1Z containing the wire-homeomorphism rules H, FCsp(7\l, T) is a 
category where: 

• objects are framed point graphs, 

• arrows are equivalence classes \G\-% of framed cospans, 

• identities are defined by pseudo-identity cospans: \tx\il> an d 

• composition is given by: \H\n o \G\n := \H o G\%. 

Proof. Composition is associative, because it is defined by pushouts as in cospan categories, so it 
remains to show that the pseudo-identity string graphs yield genuine identities in FCsp(7t!., T). For 
a framed cospan G, consider the composition: 




C2 



Y 



lyoG 

ly o G contains a copy of G, as well as some extra edges and wire-vertices. For every wire-vertex 
v 6 W(ly o G) — W(G), there exists a unique edge e 6 Ei Y oG — ^G adjacent to that vertex. We can 
always find a rewrite rule in H C 72. that produces a string graph isomorphic to ly o G with v and 
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e removed. Since all edges e 6 Ei Y oG ~~ Eg arise in this way, we can repeat the process until the 
resultant string graph is isomorphic to G. So |ly o G\n = \G\-r. Similarly, \G o lx\n — \G\fc. D 

We often want to consider the full subcategory of FCsp(72., T) containing only equivalence 
classes of "downward-directed" cospans. This category is called FCsp + (1Z, T). 

Definition 5.4.8. FCsp + (7?., T) be the full subcategory of FCsp(TZ,T) whose objects are positive 
framed point graphs. 

Theorem 5.4.9. FCsp + (7?., T) is a symmetric traced category and FCsp(7?., T) is a compact closed cate- 
gory- 
Proof. First, we show that FCsp(7?., T) is a compact closed category. For any framed graphs A, B, 
A (g> B has vertices Va + Vg which can be given a total order by placing all of the elements in Vg 
above those in V& (i.e. the usual disjoint union of totally ordered sets). The monoid product of 
framed cospans is given coproducts in SGraph T . For cospans: 



A^hG 



C 



B ^ Hi ^c 



the underlying string graphs of A ® B and C <g> D are coproducts A + B and C + D, so there is an 
induced cospan over G ® H := G + H: 



A L 



ci 



A®B ' B 

1 \d\,d^\ &i 

t \ 



G®U 

I 



' H 

j 

i [Cl,C 2 J c 2 

C(8)D 3 D 



It can easily be verified that this induced cospan is framed. The rest of the structure maps are 
analogous to those from string diagrams: 



T T 



°~A,B ■-- 




»A *A 

1 1 

-mA +mA 



CA 



+»A -mA 

T T 

• A 9A 



1 1 

+mA +mB 

All of the axioms of a compact closed category then follow from string graph isomorphism and 
edge homeomorphism. For example: 
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T 



A <g>l A )o (l A ®d A ) 



i 


1 


i 


-•.'1 


-•,■1 


-•. 


T 


I 


T 



1 



1 



r 

i: 



i 



Since any full subcategory of a compact closed category is a symmetric traced category FCsp + (1Z, T) 
is a symmetric traced category with the trace operation defined using the compact structure from 

FCsp(ft,T). a 

5.5 Free Monoidal Categories 

In this section, we will prove that the category FCsp + (H, T) is the free symmetric trace category on 
a monoidal signature T. Free categories are special in that they are sound and complete with respect 
to the axioms of that type of category. That is, we shall prove that two morphisms are equal by the 
axioms of a symmetric traced category if and only if they are equal in FCsp + (H, T). Once we prove 
this, it is a simple matter to show that FCsp(lH, T) is the free compact closed category on a monoidal 
signature, by proving that the category FCsp(H, T) is equivalent to the result of performing the 
"Int" construction [34J on the free symmetric traced category FCsp + (H, T). For simplicity, we will 
focus on strict categories in this section. 

Before we get to the bulk of the proof, we introduce some notation. The first thing we introduce 
is the notion of indexing a morphism. 

Definition 5.5.1. For a small, strict monoidal category V, fix a set of atomic objects O, such that any 
object in V is a monoidal product of elements of O. For an object X 6 ob V, an X-word is a monoidal 
product X,j ® . . . (8 X; M = X such that all \ are distinct and X, E O. 

We will assume that O contains "enough copies" of every atomic object to find X-words for 
every object X. Replacing X with an X-word is simply the act of binding each position in the 
monoidal product to a unique index that we can refer to later. 

Definition 5.5.2. For a morphism f : X — >■ Y, an indexing of f is a choice of an X-word and a Y-word 
such that 



/ = /':*, 



'X* M ->*A 



Y, 



In 



A morphism from an X-word to a Y-word for any X, Y is called indexed. 
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For an X-word X^ <g> . . . <8> X, M , and an index i E {i\, . . . , ?m}/ CX:/ is the (unique) symmetry map 
that permutes the object X, to the end of the list and leaves the other objects fixed. 

x h 

VX:i = 

X h X 'm X < 

In any strict symmetric traced category we can define a contraction operator CA— ) which 
"traces together" the f-th input with the j-th output on an indexed morphism. 

Definition 5.5.3. Let / : X ;i <8> . . . <8> X, M — > Yj <g> . . . <g> Y, N be an indexed morphism in a symmetric 
traced category such that for indices i 6 {z'i, . . . , z'm} and / 6 {/i, • • • ,/n}/ X, = Yy. Then we define 
the contraction C|(/) as follows: 

C{(/):=Tr x -V Y:; -o/ocr-)) 




Note that a contraction of an indexed morphism yields an indexed morphism, so we can con- 
tract many times. Also, the resulting morphism does not depend on the order in which we perform 
contractions. 

Lemma 5.5.4. Contractions are commutative. For an indexed morphism f distinct indices i, i' and j,f: 

ci(c(,(f)) = c j : l (ci(f)) 

Proof. Let X = X^ <g> . . . <g> X, M and Y = Y ;i <g> . . . <g> Y /N , let i, i' £ {z'i, . . . , z'm} be distinct, and let 
j,f 6 {]!, . . . ,]n} be distinct. 

C>(C(, (/)) = Tr X '= Y / ((r Y:; - o Tr X '"= Y / (<7 Y:/ o/o^)o cr"*) 

= Tr x '= Y i (Tr X ' ,=Y / ((<r y:j ® ly., ) o <r Y:f o / o cr"], o (^ ® l x ., ))) 
= Tr W >'= W ((cr Y:; - ® ly., ) o o- Y:f o / o ^ o (^ ® 1 X; , )) 

= (*) 

Let X' be a new X-word equal to X with the factors X, and X,/ deleted. Let Y' be a Y-word equal 
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to Y with the factors at Y, and Y,/ omitted. 

(*) = Tr X '^' = W ((a Y:j ® ly., ) o cr Y:; v o / o c^, o (c^ ® l x ., ) o (l x , ® l x . ® l x . ; ) 

= Tr Xi0X "'' =Y ^ Y /' ((cr Y:j ® ly., ) o cr Y:/ o / o <t£J, o (^ ® l x ., ) o (l x , ® (^ o cr x . x .,))) 

= Tr X <' Mi= W((ly, ® cr x . x.,) o (^ ® 1 Y .,) o cr Y:/ o/ o jr"* o (cr"* ® l x .,) o (l x , ® cr^ x .,)) 

= Tr X '" 0X;=Y ^ Y '((l Y , ® CTy.y.,) O ((Ty :; - ® ly.,) O <7y :/ o/ O ^ o (^ ® l x .,) O (l x , ® (T^ x .,)) 

= (*) 

Now, we need to have a look at the symmetry maps. By the coherence theorem for symmetric 
monoidal categories, a composition of symmetry maps is uniquely defined by the permutation it 
performs. First note that (ly/ ® c Y /Y ., ) ° {oy-.j <8> ly., ) sends Yj to the end of the list, so: 

(ly, ® Py //Y ., ) O (<7 Y .j ® ly-., ) = CTy.y 

Note that the cry., on the LHS refers to a different map than the one on the RHS, as it has a 
different domain and codomain. By naturality: 

Cy; O Cy.;/ = (Cy.;/ ® ly.) O Cy.; 

It can be shown similarly that: 

a X-i> ° i^XH ® !x,, ) o (1 X < ® cr^ x ., ) = cr-1 o c^ = c^ o (c^, ® l x .) 

Substituting in to the above expression, we complete the proof: 

(*) = Tr X '" 0X;= W((cr y:/ , ® l y .) o c Y:/ - o/o cr-l o (c"], ® l x ,)) 
= Tr X -' =Y /' (Tr x >= Y ; ((c Y:; , ® 1 Y .) o c Y:; o / o og. o (c^, ® l x .))) 
= Tr X ' 7=Y /' (c Y:/ v o Tr x '= Y i (c Y:/ o / o a x ]o) o c x J,) 
= 4(C|(/)) 

n 

Definition 5.5.5. For a (small, strict) traced symmetric category V, define a set M of atomic mor- 
phisms, such that any morphism in V can be obtained from those morphisms and the traced sym- 
metric structure. An indexed morphism is called disconnected if it is of the form f — f\ ® . . . ® / x , 
where each fa is an indexed morphism in M: 

Definition 5.5.6. Let / = /i ® . . . ® /)vr be a disconnected indexed map. For distinct indices 
{h,...,ip} C {z' 1/l7 ...z' K/Mj J and {/!,...,/>} C {;' X/1/ .. .]k,n k }, a map /' is said to be in contrac- 
tion normal form (CNF) if: 

/' = c£(cg(...(c£ (/))...)) 
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Definition 5.5.7. Let / and f be given as in Definition 5.5.6 A component f k of f is said to be totally 
contracted if the indices of all of its inputs occur in \i\, . . . , ip} and the indices of all of its outputs 
occur in {;'!, ...,j P }. 



Lemma 5.5.8. Let f and f be given as in Definition 5.5.6 By re-indexing the contraction, we can reorder 
the totally contracted components of f arbitrarily. 

Proof. It suffices to show that we can send any totally-contracted f k to the far right side of / by 
re-indexing the contraction. 



Cg(c£(...(c£(/i 



jp i 



f k ®...®f K ))...)) = C' i }{Cf{...{C' i f{f 1 ®...®f K ®f k ))...)) 
h h 'p 



We can show this by applying the previous lemma and naturality of symmetries. First, write f 

as/ = f L ® f k ®f R (ovf L : X L -> Y L ,f R : X R -> Y R , and f k : X k -> Y k . 

c j i l(...(df(f L ®f k ®f R )...)) = (*) 

We then pre-compose with the indentity (i.e. a swap map and its inverse). Since f k and all the 
maps after it are totally-contracted, we can eliminate one of the swap maps by re-indexing the i's. 
Similarly, we can introduce a swap after f by re-indexing the j's. 

(*)=C j ^...(C j f((f L ®f k ®f R )o(X L ®a xRiXk )o(X L ®o- xkrXR ))...)) 
= dj (. . . (C| ((/ L ®f k ® f R ) o (X L ® a xR>xk )) ...)) 
= C'h...(ch(Y L ®o- Yk y R )o(f L ®f k ®f R )o(X L ®o- xR ,))...)) = (*) 



An application of naturality completes the proof: 

(*) = d; (. . . (C'^((f L ® f R ® f k ) o (X L ® o- xk)XR ) o (X L ® a xR/Xk )) . . .)) 

= c i }(...(c i /(f L ®f R ®f k )...)) 

h 'p 

We can therefore send f k to an arbitrary position by performing this procedure in reverse. □ 

A totally contracted identity map whose input is connected to its output is called a minimal 
circle. 



Lemma 5.5.9. Let f,f be Defined as in 



5.5.6 



Vfk 



1 



x -w 



ly. is a totally contracted identity map 



that is not a minimal circle, then it can be removed by re-indexing. 



Proof. From Lemma 5.5.8 we can assume the totally contracted identity map is on the far right. 



Let M be the index of the input and N be the index of the output. By 5.5.4 we can move the two 
contractions involving the identity map all the way to the inside, so /' is of the form: 

f = c(c(...(d M (c?(f"®i XM )))...)) 
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We can reduce the inner map using the definition of Uy., and the trace axioms. 

d M {Cf{f" ® lxj) = Tr x -= V Y:; o Tr x <= Y ~((f ® 1 Xm ) o cr"])) 

= Tr x *= Vy:/ o Tr x <= Y ~((f ® 1 Xm ) o (^ ® 1 X J o (l x , ® cr^))) 
= Tr x M=v 7((rr: . T r x - Y "(((/" o ^) ® 1 X J o (l x , ® cr^))) 

= Tr^^^.o/'o^ oTr x '=^((lx' ®< /X ,))) 
= Tr x M= Y ,(cr r:i o/" o a x \ o (l x , ® Tr x <= Y ~(^ X; ))) 

= Tr X «= Y /((7V : y o/" o <r- : ] o (l x , ® l XM=Xi )) 
= lr X - Y /K :; o / "oc7 X: l)=c|(r) 

n 

Theorem 5.5.10. FCsp + (H, T) is the free strict symmetric traced category on the monoidal signature T. 

Proof. Let T = (O, M, dom, cod) be a monoidal signature, V be a strict symmetric traced category, 
and F : T — > V be a monoidal signature homomorphism. There is an evident signature homo- 
morphism E : T —> FCsp + (HT, T), taking each object Z £ O to the framed point graph consisting 
of a single wire-vertex of type Z and taking each g 6 M to a homeomorphism-class of cospans 
consisting only of a box-vertex of type g and its inputs and outputs. 

Let X be a framed, positive point graph, where Vx = {pi < P2 < • • • < Pn}- Then, for all p„ 
F(r x (p,')) is an object in V. Let: 

F(X)=F(T X (pi))®...®F(T X (p N )) 

This map is uniquely specified, and it respects the monoidal product on objects in FCsp + (H, T). 
Let |G| : X — > Y be an arrow in FCsp + (H, T), represented by a cospan X — > G < — Y. Let 
{x-[ < X2 < ... < Xm} be set of wire-vertices in G that are in the image of d, inheriting the total 
ordering from X. Let {1/1 <yi< ... < y^} be the same for Y. Define a function F' from Vq to the 
morphisms of V as follows: 

. j F(tg (p) ) if P is a box-vertex 

F {v) = < 

I 1f(t g (d)) if p is a wire-vertex 

Let {zi, . . . ,zx} C Vq be the set of vertices not in the image of d or c. Define a disconnected 
indexed morphism in V. 

/ = F'{x x ) ® . . . ® F'(x M ) ® F'(yi) ® . . . ® F'(y N ) ® F'(z x ) ® . . . ® F'(z K ) 

We can form a CNF term from \G\ by adding a contraction CA— ) for every edge e in G. We 
define the indices for these contractions as follows. If Tc(e) = rnidz, then F'(s(e)) and F'(i(e)) are 
both identity maps. Let i be the unique input of F'(t(e)) and j is the unique output of F'(s(e)). If 
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Tq (e) = in ? / c , then F' (s (e) ) is an identity map. Let / be the unique output of F' (s (e) ) and let i be the 
k-th input of F'(£(e)). If Tg(c) = out^, then F' (t{e)) is an identity map. Let z be the unique input 
of F'(t(e)) and let; be the fc-th output of F'(s(e)). Define F(|G|) as: 



F(\G\ 



Cf(C?L..{Cf(h®...®f k ®...®f K ))...)) 



For this to be well-defined, we need to show that this does not depend on the choice of G. Choosing 
an isomorphic string graph cospan amounts to picking a different order for the internal vertices z, 
and the contractions CA-). Any internal vertices in G will be totally contracted in F(|G|), so 
by Lemmas 5.5.4 and 5.5.8 choosing an isomorphic cospan will not affect the value of F(|G|). 



Choosing a homeomorphic cospan amounts to varying the number of totally contracted identities 
that are not minimal circles. By Lemma 5.5.9 this does not affect the value of F(|G|) either. 



It is straightforward to show that F takes identities to identities and respects composition and 
traces, so F is a symmetric traced functor from FCsp + (H, F) to V. Thus F satisfies the universal 
property of the free symmetric traced category. 

/: FCsp + (H,F) 




Since it is possible to build any positive framed cospan using the cospans in the image of £ and 
the traced symmetric structure, F is the unique map making this diagram commute. □ 

Theorem 5.5.11. There is a symmetric monoidal equivalence of categories Int(FCsp + (H, F) ) = FCsp(H, F), 
such that the following diagram commutes: 



FCsp+(H,F) c Int(FCsp+(H,F)) 



FCsp(H,F) 



(5.5) 



Proof. The category Int(FCsp + (HT, F)) has as objects pairs of positive framed point graphs {A, X) 

and as arrows H-equivalence classes of cospans. An arrow \G\ : (A, X) — »■ (B,Y) is represented 

d c 

by a cospan A + Y — > G < — B + X. Composition is done by composing on the first object and 

"composing backwards" via the trace on the second object. 
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HZ 




Y 






1 


/■ L 








v x 


nZ 




' Z 


• X 




Y 






1 




NX 





H 



Since the domain and codomain are disjoint unions of framed point graphs, we can decompose 
the map d into maps d\ : A — » G and di : Y ^ G and the map c into maps c\ : B — » G and 

c 2 : X -> G. 

We can then form a new cospan by interchanging X and Y and flipping their sign maps. Since 
&l and c-i are just string graph homomorphisms, we can consider them to have domains X* and Y* , 
respectively. Thus we obtain a cospan representing an arrow in FCsp(lH, T). 



A + X *^ G ^ B + Y * 



(5.6) 



Let F((A,X)) = A + X* and let F(|G|) be the H-equivalence class of cospans represented by 
5.6}. We can show that F respects composition: 




^ 



< 


Y 


i 


X 

1 




■ 


1 
tX 


' 1 








\tj 




^y 


t 

X 


I 






1 \ 




"~~-i 


z 

1 


> 




1 


»z 


Y 


' 


1 

z 




It can also be verified that F preserves all of the traced symmetric structure, up to isomorphism. 
Flipping X and Y induces a bijection of hom-sets: 

hom FCs P + (H,T) (A + Y, B + X) ^ hom FCsp(H/T) ( A + X* , B + Y* ) 

so F is full and faithful. For all positive framed point graphs A, X, objects A + X* are in the image 
of F. For an arbitrary object Z in FCsp(H, T), we can define a symmetry isomorphism that sends 
all the positive points to the left and all the negative points to the right. 
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So F is essentially surjective. Therefore there is a traced symmetric equivalence of categories 
Int(FCsp + (H,F)) S FCsp(H,F). FCsp+(H,F) fully embeds into Int(FCsp+(H,T)) as objects 

d c 

(A,0) and cospans A + — > G < — B + 0. The functor F acts trivially on these cospans, so 



diagram 1 5.5 1 commutes. □ 



Corollary 5.5.12. FCsp(H, T) is the free compact closed category on a monoidal signature T. 

Proof. The embedding of T in FCsp(H, F) factors through FCsp + (H, F). The universal property 
follows from composing free constructions. 




F is a monoidal signature homomorphism, F is the unique traced symmetric functor induced by F, 
and F is the unique compact closed functor induced by F. □ 

A consequence of this construction, is we can quite easily build "free categories containing an 
X" (e.g. the traced symmetric and compact closed analogues of Lawvere theories, PROPs, etc.) as 
rewrite categories. For an algebraic theory £ , we can translate all of the equations in £ into rewrite 
rules, forming a rewrite system 1Z. Applying the axioms of an algebraic structure to a morphism in 
a monoidal category corresponds precisely to rewriting with the rules in 1Z. Purely by construction, 
a category of the form FCsp(7\l, F) satisfies the axioms of £ , and nothing else. We can formalise this 
using another unique factorisation property. 

For a rewrite rule L — > R E 1Z, the string graphs L and R (regarded as cospans over their 
inputs and outputs) define morphisms |F|h/ |^|h G h° m FCsp(H,r) (%■> ^0 m the rree compact closed 
category over F. 

Theorem 5.5.13. Let TZbea string graph rewrite system containing H. Let F : T — >■ V be a valuation of a 
monoidal signature T such that, for the functor F : FCsp(H, F) — > V and every rewrite rule L — > R £ 1Z, 
F(|L|n) = F(|R|]h)/ the valuation F factors uniquely as: 
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Proof. Note that for any rewrite system 1Z D H, then < — >ji D < — >jj, so we can think of the 
morphisms of FCsp(7£, T) as equivalence classes of morphisms in FCsp(H, T), where the quotient 
functor Q is identity-on-objects, and takes an equivalence class |G|u to the unique (possibly larger) 



equivalence class \Hl\h 2 |G|h- This is a well-defined functor by Theorem 5.4.5 and it is possible 
to define Eji := Q o E. Thus is suffices to show that F factors uniquely through Q: 

F 




FCsp(H,T) FCsp(ft,T) 

For existence, let Fr.(X) : = F(X) on objects. On arrows, let F-n(\G\-n) : = F(|G|h)- For this 
to be well-defined, it must not depend on the choice of representative G 6 \G\n. First, assume 
that G — r>ji G' for some G'. Then, there exists a rule L — > R E TZ and a matching m : L — s> G 
that rewrites G into G'. By deforming G and G', we can always find edge-homeomorphic cospans 
H e |G| H and H' E |G'| H such that: 

H = H out o (L <g> H side ) o Hh, and H' = H ou t o (R® H side ) o H in 

where ® and o are tensor and composition of framed cospans. Since | • |h respects tensor and 
composition: 

|G|h = | H ut | h ° (|-L|h ® |H S idelH) ° |Hjn|H and |G'|h = |H 0U t|iH ° (|-R|h ® |H S id e |H) ° iHmle- 
Then, since F is a strict monoidal functor, we have that: 

F(|G|„) = F(|H ut| H ) o (F(|L| H ) ® F(|H side | H )) oF(|H in | H ) = 
F(|H 0U t| H ) o (F(|R|„) ® F(|H side | H )) o F(|H fa |„) = F(|G'| H ) 

Since |G|tj represents the closure of G with respect to — >7j, this argument can be iterated to 
show that for any G, G' E \G\%, F(|G|h) = F(|G'|h)- Thus F K is well-defined. Regarding the 
hom-sets of FCsp(72., T) is quotients of hom-sets in FCsp(H, T), Fji and is unique functor induced 
by the universal property of quotients in Set. □ 

The exact same argument carries through replacing FCsp with FCsp + everywhere. Thus, rewrite 
categories form the most general compact closed (or symmetric traced) categories containing the 
equivalence closure of a rewrite system. They form the formal basis for reasoning about all of 
graphical theories we introduce in the next part. 
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Part II 



Entanglement, Graphically 



113 



Chapter 6 

Quantum Information and 
Entanglement 



Quantum information theory is the study of how data can be encoded and manipulated using mi- 
croscopic systems subject to quantum effects. Over the past two decades, it has grown into a large 
and diverse field, with applications in security, foundations of physics, and perhaps most notably 
quantum computing. In this chapter, we introduce the basics of quantum mechanics, quantum 
information theory, and models of quantum computing. 

6.1 Quantum Mechanics 

This section is provided for the non-physicist to briefly introduce the basic concepts of quantum 
mechanics used in this dissertation. Those familiar with QM can safely skip it. 

Quantum mechanics is a strange but very successful theory of the universe at small scales. Its 
Hilbert-space formulation has four key components, which we shall focus on in detail: 

1. States encode all of the information about a quantum system. These are represented as nor- 
malised vectors in a (complex) Hilbert space. 

2. Observables give us "questions" to ask about a quantum system, and provide the mathemat- 
ical means to turn a state into a probability distribution over measurement outcomes. These 
are given as self-adjoint operators on a Hilbert space. 

3. Dynamics describe how a state evolves over time. These are expressed as unitary operators. 

4. Compound systems are expressed as tensor products of simpler systems. 

To describe these components, we use Dirac's bra-ket notation. Recall that for any vector v in a 
Hilbert space "H, there is a natural way to get a linear map (p v : H — > C (i.e. a vector in the dual 
space %*), using the inner product defined on "H: 

<p v (u) = (v\u) 
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Vectors in the dual space of % are used so often that we employ the following notational trick. 
We write a vector v 6 H as a ket \v) E H, and we write the associated linear map <p v as a bra (v\. 
(v\ is a function from % to C, so we can apply it to a vector \u) E T-L. Then, some notational magic 
happens: 

(v\ \u) = (v\u) 

For this reason, we refer to the inner product as a bra-ket. We define an operation ( — ) + taking 
bras to kets and vice-versa. 

(\v)) f = (v\ and ((^|) + = \v) 

This operation naturally extends to linear maps: 

L f \u) = ((«|L) + 

Fixing an orthonormal basis, with respect to ( — |— }, we can represent \v) as a column vector 
and L as a matrix. Then, ( — ) + just becomes the conjugate-transpose: 



V 2 J v ' V fl 21 «22/ V fl 12 fl 22 

( — ) + extends to a contravariant functor t : Hilb op — » Hilb, giving (Hilb, ®,C) the structure of 
a t-monoidal category. 

A vector \ip) 6 T-L is the same thing as a linear map C — > "H sending 1 £ C to |i/>). We use 
these two notions interchangeably. As string diagrams, we represent kets as triangles with a single 
out-edge and bras as triangles with a single in-edge. 



i*> 



f <♦! = i 



In quantum mechanics, pure states are unit vectors in a Hilbert space W. Crucially, the fact that 
T-L is a vector space gives us a way to super-impose several quantum states to form a new state. 

l£>=E«*lfc> 

I 

In this case, we say that the state |£) is in a superposition of the states \ipi). 

If | j/>) is a state, then (tp\ can be thought of as a function that measures the extent to which some 
given state is \ip). This is the essential content of the Born rule, which provides a method for turning 
a state and an observable into a probability distribution on measurement outcomes. An observable 
O is a self-adjoint (O = + ) operator from a Hilbert space M. to itself. If 1-L is finite-dimensional 
then all self-adjoint operators diagonalise: 

O = Y^Ki\Vi)(Vi\ 
i 

We can choose «,- such that each of the eigenvectors \xj) are normalised, in which case we call them 
the eigenstates of O. We can therefore interpret O as a set {«,} of measurement outcomes and a set 
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{|l>j)} of possible outcome states. Picture an experimental setup, where we have some quantum 
state \ip) in a box and a measuring apparatus hooked to it. We dial in O as the thing we want to 
measure and push a button. Suppose for simplicity that O only has two possible outcomes: 

= l|l7 1 >(l7 1 |+2|l»2><l'2| 

When we push the button, the screen says 2. Thus, we know the second measurement outcome 
occurred. In that sense, the eigenvalue a,- should be thought of as a "marker" for the z'-th mea- 
surement outcome. The most informative observables are the ones where all of these markers are 
distinct and non-zero. These are called non-degenerate observables. Three non-degenerate observ- 
ables that play a particularly important role in quantum information and quantum computing are 
the Pauli spin operators: 

*-(!!) -(-■») z =G-°0 

Furthermore, upon getting outcome 2, we know that the state | ip) must be | V2) ■ Suppose before- 
hand that we had prepared \ip) in some superposition of \v\) and \V2)- As soon as we measured 
\ip), this superposition collapsed to a single state \l>2). This phenomenon is known as the collapse 
of the quantum state, or "collapse of the wavef unction". What this means physically is a ques- 
tion of interpretation, but mathematically it means after the measurement occurs, we can treat the 
quantum system as if it is in the state \V2). 

We compute the probability of getting outcome z using the Born rule. 

Frob(i,\ip)) = \(v t \ip)\ 2 = (^(v^) 

The key point here is that the Born rule is a function of the inner product. Because of the role it 
plays in measurement, we sometimes refer to elements of the dual space (i/?| € 7i* as effects. 

Since \ip) is normalised, the sum of the probabilities of all outcomes is 1, so Prob(z, \ip)) is a 
probability distribution. These probabilities provide our only access to the "real" quantum state, 
so we consider two states to be equal if they give the same probability distributions with respect to 
any observable. However, the Born rule yields the same probabilities for states | xp) and e' e | ip) . 

7°e w (ip\v i }(v i \ip}=e- ie e w (ip\v i }(v i \ip) = (v t \ip) (<l>\vi) 

The scalar factor e ,e is called a global phase. We always identify states (and hence operators) differing 
only by a global phase. 

The main reason the Pauli matrices are so interesting is that every distinct pair of them is com- 
plementary. Two observables are called complementary if their associated bases of eigenstates are 
equally-far apart. Let O, O' be observables in a D-dimensional Hilbert space with eigenstates { \vj)} 
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and { v'j)}. Then O and O' are called complementary if their bases are mutually unbiased. That is, 
they satisfy the following equation for all i,j: 



\(viWj)\ 2 = ^ (6-2) 

We can interpret this definition using the Born rule. If a state | xp) is in the z'-th eigenstate of O, 
then measuring O will obtain outcome i with certainty. So, if we know | ip) is in an eigenstate of O, 
we have maximal knowledge about the O observable. However, if we measure | ip) with respect 
to the O' observable, we are equally likely to get any outcome. So, maximal knowledge about O 
implies minimal knowledge about O'. 

Unlike measurements which cause a state to collapse, dynamic evolution of quantum states 
is always reversible. We can evolve a quantum state in time by applying a unitary (LT + = IT -1 ) 
operator. One should interpret the dagger of a unitary operator as that some operator, "done back- 
wards". To be a bit more explicit, suppose we represent the evolution of a state according to the 
Schrodinger equation, for a self-adjoint operator H called a Hamiltonian. 

ihj t m)) = um)) (6.3) 

If H does not depend on t (e.g. if the forces acting on a particle are constant), then the value of 
a solution at time t has a simple expression in terms of the value at some initial time 0. 

W0> =*-('/»)"* |?(o)> 

Letting U(t) = e -\ l /h) tH t then since H is self-adjoint, U(t) is unitary for all t and describes the 
evolution of \ip(0)) under H for time t. U( — t) then corresponds to the same evolution, but with 
time running backwards. Since U(t)U(—t) = U( — t)U(t) = LZ(0) = 1%, it must be the case that 

u{ty = u(-t). 

6.2 Compound Systems and Entanglement 

Suppose we have a particle in the state \ip) sitting in some potential well and another particle 
|<|>} sitting in another one far away. We use the tensor product to "pair up" the states of the two 
particles. That is, the overall state of the system is \ip) <S> \(p). But, since this is a quantum state, it 
could be in some superposition: 

-^ (lfc> ® \<Pi) + l«fc> ® \<h)) 

Suppose one state is in a superposition |i/>i) + |i/>2). Then the combined state is also in a super- 
position: 

(hh> + lfc» ® I'?) = (l^i) ® l^» + (l^2> ® !<?)) 
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This bilinearity justifies our use of the tensor product, since <8> provides the most general bilinear 
pairing for two spaces. Some states in the tensor product space H\ <g> H 2 can be written in the form 
\xp) <g> \<p) for \ip) £ M.\ and \tp) £ H 2 . Such states are called separable or product states. However the 
vast majority of states |Y) £ Hj <g> H 2 cannot be written this way. These are called entangled states. 

A measurement on one subsystem of an entangled state collapses the entire state, thus affecting 
the other subsystem. For instance, suppose we have some observable 

O = l|i7 X ) (vi\ +2\v 2 ) (v 2 \ 

as before, and an entangled state 

|Y) = -^{\v 1 )®\v 1 ) + \v 2 )®\v 2 )) 

and we measure O on the left subsystem, getting outcome 1. The whole state is now: 

\r) = \v 1 )(8)\v 1 ) 

If I Y) were a product state, then the second system would be unaffected, so entanglement is the 
crucial property that allows such correlations at a distance. 

Entanglement is also a source of computational complexity for many-body systems. For finite 
dimensional Hilbert spaces, the dimension of the tensor product of two spaces is the product of the 
dimensions of each space. 

dim (Hi <8> Hi) = dim (Hi) dim (H 2 ) 

So, the dimension of a compound system increases exponentially with the number of subsystems. 

dim( H® .®H )=dim{H) N 

N 

Computing with such states quickly becomes untenable, even for low-dimensional H, which is 
the main reason for trying to understand quantum phenomena like entanglement from a more 
structural level. 

6.3 Mixed State Quantum Mechanics 

Often it is more convenient to work with probabilistic mixtures of quantum states, rather than 
states that are totally determined. This is because nearly all procedures for preparing a quantum 
state in a lab only succeed with some probability. A set of quantum pure states along with asso- 
ciated probabilities {(|t/>j) , Pi)} is called an ensemble. We can compute the probability of getting a 
particular measurement outcome on an ensemble using the Born rule, adjusting for probabilities. 

Prob(f,{p ; , \iPj)}) =£p j \(v l \ip j )\ 2 = Epyfa#;Kty>.-> = (Vi\ ( EP; ItyXtyl J l"»> (6-4) 
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As in the case with pure states, we consider two ensembles to represent the same "state" if they 
produce identical probability distributions for all observables under the Born rule. From l |6.4} , we 
can see that two ensembles produce the same probability distributions if and only if: 

i i 

For normalised vectors \ipj) and YLifi = ^' * ms * s me general form for a trace-1 positive operator. 
For that reason, we call trace-1 positive operators mixed states. Because they encode probability 
densities, they are also sometimes called density matrices. Pure states can be represented as density 
matrices of the form | ip) (ip | . 

Just as states cannot in practice be prepared with certainty, so too is the case for quantum evo- 
lutions and measurements. The mixed version of a unitary evolution is a completely positive 
map (CPM). For a finite-dimensional Hilbert space M., let C(H) be the vector space of linear maps 
H — > ~H. CPMs are just linear maps <$> : £(H) — > C{W) that take positive operators to positive 
operators. 

Outcomes for pure measurements span an orthonormal basis. A particular observable O cor- 
responds to the decomposition of the identity into 1-dimensional projectors corresponding to each 
outcome for O. 

hi =L|t>i)(P*l 
i 

The probability of getting a particular measurement outcome on a mixed state can be computed 
as in 6.4 by tracing the composition of the projector and the state's density matrix. 

Prob(z',,o) = (v { \p \vj) = Tt((Vi\ p \Vi)) = Tv(p \Vi)(Vi\) 

For that reason, pure measurements are often referred to as projective measurements. The mixed 
version of a projective measurement is a positive operator-valued map (POVM). In finite dimensions, 
this is just a set of positive operators P; 6 £(Ji) that sum to the identity. As in the projective 
case, probabilities are computed by tracing the composition of the positive operator and the state's 
density matrix. 

Prob(z» =Tr(pPi) 

For the majority of this dissertation, we will only need concepts from pure-state quantum me- 
chanics. However, when we look at multipartite entanglement in chapter [8l it will occasionally 
be useful to ignore a subsystem of an entangled pure state. This can be done probabilistically by 
tracing out that subsystem and renormalising (if necessary). This is called the reduced density matrix 
of an entangled state. For a state |Y) 6 ~Hi ® H2 and p\2 '■= I 1 ?) (Y|, we can ignore the system TL2 
by tracing it out. 
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6.4 Quantum Computation 

Quantum computation refers to the encoding of data into quantum states and the use of evolution 
and measurements to perform computations on that data. As in programming language design, 
quantum computation can be carried out using one of a variety of paradigms. We shall introduce 
two of them here: the circuit model and measurement-based quantum computation (MBQC). 

For a D-dimensional Hilbert space, we often fix an orthonormal basis 1 0) , 1 1 ) , . . . , | D — 1} called 
the computational basis. The two-dimensional Hilbert space C 2 plays a special role in quantum 
computation, and is called the space of quantum bits, or qubits. The basis vectors |0) , |1) EC 2 can 
be thought of as classical bits, embedded in the bigger space of qubits. We also introduce a special 
notation for tensor products of basis vectors using bit strings: 

10010110} = |0) <g> |0) <g> |1) <g> |0) <g> |1) <g> |1) <S> |0) 

In C 2 the space of qubits, the basis |0) , |1) corresponds to the eigenbasis of the Z observable. 
We use the term "measuring in the computational basis" to mean performing a measurement with 
respect to Z. We also define the X basis | +} , 

I+>:=^(|0> + |1» 

|->:=^(|0>-|1» 

6.4.1 The Circuit Model 

In the circuit model, quantum computation proceeds in three steps: 

1. Prepare an N-qubit quantum state (usually a product state). Some of these qubits are treated 
as inputs and others simply as "helper" qubits called ancillas (which are usually initialised 

to |0}). 

2. Evolve the prepared state using small (usually 1- or 2-qubit), fixed-time evolutions called 
quantum gates. 

3. Measure some or all of the qubits, yielding the result of the computation. Unmeasured qubits 
are sometimes treated as outputs. 

Graphically, we represent a circuit evaluation as a string diagram. 



and the Y basis 


\i)A 


—i) as 


follows. 


\i) := 


1 

7! 


(|0) + 


»|1» 


H>== 


i 

V2 


(|o>- 


»|1» 
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Hi 



ill 



u 3 



i r 



preparations 



> gates 



L/ 4 



U 5 



n\ rA rA rA rA 



measurements 



Usually measurements are performed over the computational basis. That is, we measure the 
Pauli Z observable given in equation ||6.1}. Using the Pauli operators as the Hamiltonian in the 



Schrodinger equation 16.3 I, we can produce unitary evolutions we call the phase gates. 



Xa 



-{i6/2)X 



Yfl 



-(«'9/2)Y 



-{ie/2)Z 



Up to a global phase, we can recover identities and the Pauli gates themselves as phase gates. 



Xq — Yo — Zq — 1 C 2 



Xr. 



X 



Yjx — Y Zjj- 



It is a well known fact that any single-qubit unitary can be constructed from phase gates as 
U = Z a XaZ T This is called the Euler decomposition of the unitary. An important single-qubit gate 
that is not a phase gate is the Hadamard gate H. 

1 



H 



n/2^—7t/2^-n/2 



- r 

H interchanges the eigenbasis |0), |1) of Z with the eigenbasis |+) , |— } of X. A simple conse- 
quence is that HZqH = Xg. Perhaps the most common 2-qubit gate is the controlled-NOT or CNOT 

gate. 

' |00) ^ |00) 

|01) H^ |01) 

|10) H^ |11) 

I in) -> |io) 

It is called the controlled-NOT gate because the first qubit controls whether the second qubit 
has a NOT (a.k.a. X) gate applied to it. A generalisation of CNOT gates are controlled-unitary 
gates, which conditionally apply a unitary to the second qubit. 



-® 



1 



U 

T 



|0)<g>|t/>) ^ \0)®\ip) 
\l)2)\ip)^ |1>®(U|^» 



These are examples of gates that can take product states to entangled states. As such, they 
are sometimes referred to as entangling gates. For example, applying a CNOT gate to the state 
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1+) ® |0) yields an entangled state 4s(|00) + |11}) called the Be// state. Examples of gates that are 
not entangling gates are tensor products of 1-qubit gates, such as X ® Z. These will always take 
product states to product states. 

Just as AND, OR, and NOT gates can be used to construct arbitrary classical circuits, we have a 
notion for a set of quantum gates being able to construct arbitrary quantum circuits. We say a set 
of gates is universal for quantum computation if any N-qubit unitary map can be constructed from 
compositions of those gates. 

Theorem 6.4.1 ([7]). The gates Zg, H, and CNOT are universal for quantum computation. 

Many important quantum algorithms, such as the quantum Fourier transform, Shor 's factoring 
algorithm, and Grover's search algorithm, can be presented in the circuit model. 

6.4.2 Measurement-based Quantum Computation 

Measurement based quantum computation (MBQC), which is sometimes called one-way quan- 
tum computation, provides a different, equally powerful paradigm for quantum computation. For 
comparison to the previous section, we can organise the MBQC procedure into three steps. 

1. Prepare a known, highly-entangled state called a graph state. This graph state may be entan- 
gled to some qubits in an unknown state called input qubits. 

2. Perform measurements at arbitrary angles, where the choice of angles can depend on previ- 
ous measurement outcomes. 

3. Optionally, perform single-qubit corrections on unmeasured, output qubits. 

Graph states are constructed by preparing a collection of (non-input) qubits in the |+) state, 
then applying controlled-Z gates to pairs of qubits to introduce entanglement. We represent such 
a state by drawing a vertex for every qubit and an edge whenever a controlled-Z gate is applied. 
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This representation is unambiguous because controlled-Z gates are symmetric and commute 
past each other. We define generalised measurements, or measurements with angles as follows. An 
Xg measurement on single qubit consists of first applying the unitary gate Xg then measuring the 
X observable. A Zg measurement consists of first applying the unitary gate Zg then measuring the 
Z observable. Since they are measurements, neither of these operations is deterministic. 

Each measurement has two possible outcomes: a "desired" outcome, and an "erroneous" out- 
come. The latter can be thought of as the quantum version of an "accidental bit flip" during the 
course of the computation. The key to MBQC is that if we choose our measurement angles wisely, 
we can correct these errors as we go. If there are no measurements left to perform, we complete the 
calculation by applying any remaining corrections as single-qubit unitaries on the output qubits. 
This is known as feeding forward corrections. There are several techniques for identifying and using 
graph states for deterministic MBQC, such as identifying a generalised flow for the graph |fl2). To 
give a feel for how feed-forward works, we provide a simple example. 

Example 6.4.2. Prepare the following graph state, where Cj\ and q$ are in an unknown state (i.e. 
they are inputs) and qi is prepared in the | +} state. 

<7l <?2 <?3 



First measure q^ in Z, getting outcome i G {0, 1}. Letting Zq = 1 C 2 and Zj = Z, apply Z, to q\ 
and apply Z;H to qi. Treating q\ and qi as outputs, this procedure computes the CNOT of q\ and q^, 
regardless of the outcome of the measurement of q$. As Z, depends on the outcome of measuring 
qj,, this is an example of feeding-forward a measurement outcome to a correction. 

This may seem like an excessively roundabout way to apply a CNOT gate, especially since a 
naive implementation involves applying 2 controlled-Z gates to prepare the graph state. However, 
if one assumes we have a stock of suitably nice graph states, we can actually perform arbitrary 
quantum computations using (comparatively easy) single qubit unitaries and measurements. 
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Chapter 7 

Categorical Quantum Mechanics 



Categorical quantum mechanics (CQM) refers to a broad program initiated by Abramsky and Co- 
ecke in 2004 |3| that emphasises the abstract, categorical, and compositional aspects of quantum 
mechanics. The core thesis is that the structure-rich setting of Hilbert spaces and linear opera- 
tors obscures the causes of many quantum phenomena. Therefore reasoning in the comparatively 
sparse context of an arbitrary monoidal category yields practical and foundational insights that 
were previously hidden. Depending on the aims of a particular project under the umbrella of 
CQM, this proscription of Hilbert spaces can be taken literally, as in Paquette's PhD thesis If57ll for 
example, in order to obtain structural results about quantum theory that are independent of Hilbert 
space formulation. This route could prove fruitful in light of doubts that the Hilbert space formal- 
ism is the "correct" way to think about quantum mechanics, as expressed in these now-infamous 
words of its progenitor. 

I would like to make a confession which may seem immoral: I do not believe absolutely in Hilbert 
space any more. [54, John von Neumann (1935)] 

Alternatively, one can take a less "hard-line" approach by using categorical techniques to com- 
plement and expand upon concrete results based on Hilbert spaces. We adopt this approach in the 
sections to come. 

This chapter offers an introduction to categorical quantum mechanics and a handful of illus- 
trative examples. We employ notions from CQM to show how complementary observables can 
be studied as interacting Frobenius algebras and offer several new results about special types of 
complementary observables called strongly complementary observables. Most notably, we give 
a classification theorem for strongly complementary pairs of observables and show that a set of 
pairwise strongly complementary observables must contain no more than 2 distinct observables. 
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7.1 Compact Categories and Teleportation 

Quantum protocols exploit the unique features of quantum mechanics (typically entanglement and 
superposition) to perform a task that would be difficult or impossible classically. The canonical 
example of a quantum protocol is quantum teleportation, whereby one party (called Alice) can 
transmit an arbitrary quantum state to a second party (called Bob) using only a shared entangled 
state and a classical data channel. To start, Alice has a quantum state | ip) and Alice and Bob share 
a Bell pair. That is, a pair of qubits in the state |00) + |11) (ignoring normalisation factors). 

Alice performs an entangled, 2-qubit measurement called a Bell measurement on \xp) and her half 
of the Bell pair. A Bell measurement consists of measuring the two qubits in the Bell basis: 

|Y ) = |00) + |11) |Yi) = |00) - |11) |Y 2 ) = |01) + |10) |Y 3 ) = |01) - |10) 

She gets an outcome i E {0, 1,2, 3}, which she then sends to Bob. Bob then applies a unitary 
correction to his half of the Bell pair, based on i: 



Un 



^ 



Ui 



U 7 = x 



Lh 



xz 



Once this is done, Bob's qubit will be in the state \ip), i.e. the state of \ip) has been teleported to 
Bob. We can represent this protocol in circuit language: 



Alice 



Bob 



Alice 



Bob 





This protocol works because, by performing a Bell basis measurement, Alice projects out her 
two qubits using the associated bra ( Y, | . 



Alice 



Bob 



Alice Bob 
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We can express all four elements of the Bell basis in terms of the Bell state and the corrections 
we defined beforeQ 

|Y,) = (1 ® U { ) |Y ) = (Uj ® 1) |Y ) (Ti| = (To | (1 !i, + ) = (T | (LT, 1) 

Thus, we can pull the LT, out of the measurement all the way to the end, and prove the telepor- 
tation protocol works for all i. 





(*) 



The crucial step is (*). This identity should look familiar. Teleportation, like many quantum 
protocols exploits the fact that FHilb is compact-closed and finite-dimensional Hilbert spaces are 
all self-dual (X S X*). 



Using this insight, one can perform teleportation in any self-dual compact-closed category, in- 
cluding Rel, Mat(R), and (perhaps surprisingly) Spek, the category Rob Spekkens' toy theory The 
last example is surprising, because Spek can be defined using a local hidden variable model. Thus 
teleportation succeeds even in the absence of non-locality for a physical theory. For more details, 

see nsma. 

7.2 Complementary Observables as Frobenius Algebras 

The eigenstates of an observable play a key role in quantum mechanics. They form the set of pos- 
sible outcome states one obtains by performing a measurement. Classical data is obtained from 
a quantum system via measurements, so an orthonormal basis of measurement outcomes can be 
thought of as a particular classical context embedded in the overall quantum state space. In study- 
ing the interaction of multiple classical contexts (especially complementary ones), we can see the 



1 Note that applying (I, to the right qubit of |Y) has the same affect as applying Uj on the left qubit because all of the 
maps Uj, written as matrices over the computational basis, have real entries. A more generalised scheme is provided in ] 3 1, 
replacing (— ) + with ( — ) O t . 
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unique features of quantum mechanics. The question is, can we study the concept of a "basis" over 
an object in an arbitrary t-compact category? 



Recall in example 3.2.24 we used a basis of a vector space to construct a special commutative 
Frobenius algebra. 

5 :: e, h-> e t ® e,- e :: e* h- >• 1 f< :: e; (g) e, i->- e; 77 :: ]Pej 

It was also noted that all SCFAs over an algebraically closed field are of this form. So, there is 
a one-to-one correlation between SCFAs and arbitrary bases. However, projective measurements 
like the ones we have described have outcomes in an orthonormal basis. Orthonormal bases can be 
captured in a t-compact closed category using t-special commutative Frobenius algebras. 

Definition 7.2.1. A i-special commutative Frobenius algebra, or t-SCFA, ( A, 6^ , e + , 5, e) is a t-Frobenius 
algebra such that 5^5 = \ A . 

In ETj Coecke, Pavlovic, and Vicary showed that t-SCFAs in FHilb are in one-to-one correspon- 
dence with orthonormal bases. So, for any orthonormal basis {\i)} in a finite-dimensional Hilbert 
space, there exists a unique t-SCFA whose comultiplication copies the basis vectors and whose 
counit deletes them. 

5 :: \i) >->• \ii) e :: \i) h-s> 1 

The basis vectors \i) are called the classical points of 5. This respects the no-cloning principal 
in quantum mechanics, because 5 cannot copy any arbitrary state, only those in {|i)}. In fact, one 
can prove that these are the only vectors copied by 5 and deleted by e, so a basis can always be 
recovered from a t-SCFA by taking the set of classical points. For an observables O and O', let the 
associated t-SCFAs be: 

&o ■= A e o ■= A So> ■- A e o> 



Then, we write classical points as triangles of the same colour, and their associated bras as 
upside-down triangles of that colour. 



Two observables are complementary if their bases of eigenstates are mutually unbiased. That 
is, for any i, j, \{vi\v'-)\ 2 = 1 / D. In the graphical notation: 
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1 

D 



A question posed by Coecke and Duncan [15| was, "Can we represent complementarity purely 
in terms of interacting Frobenius algebras?" It turns out that complementarity is equivalent to a 
simple diagrammatic identity between two t-SCFAs. First, we can move 1 / D in the above equation 
to the other side and express it as a circle, as the trace of the identity always equals D. Then, replace 
1 on the RHS with "deleted points". 



o 



J7 w 



(7.1) 



Frobenius algebras fix an isomorphism of a space with its dual space. In the case of t-SCFAs, this 

OT / , 

. Graphically: 



isomorphism takes a classical point to its adjoint: \V{) — (Vi\ and 




We can simplify the LHS of equation | |7.1) using this fact. 

A 




= O 




The equation (*) is due to two applications of the spider theorem to merge the grey and white 
vertices, leaving a single edge connecting grey to white. Plugging this back into equation \7.1) , we 
get: 



o 
\y V/ 

Since this equation holds for all i, j and the classical points span the entire space, we can con- 
clude that a more general identity holds: 



O 



? 



(7.2) 



Suppose we define a map S, serving as an antipode: 



S = 



(7.3) 
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Then, equation 1 7.2 1 resembles the equation found in Definition 3.1.4 of a Hopf algebra. 




o 



For that reason, we refer to \7.2\ as the Hopf law. 

Theorem 7.2.2. Two t-SCFAs correspond to complementary observables if and only if they satisfy the Hopf 
law. 

All complementary observables satisfy the Hopf law. When a complementary pair of observ- 
ables actually extends to a (scaled) Hopf algebra, we call them strongly complementary. 

Definition 7.2.3. Two observables O and O' are called strongly complementary if their associated 
t-SCFAs satisfy the following equations, called the scaled bialgebra equations. 



Note that we have only required that (A, }Iq, t/q, 5qi , sqi) be a bialgebra, up to scalar factors. 
However, we can show that bialgebras consisting of t-SCFAs automatically satisfy equation | |7.2| , 
so they are Hopf algebras. Before we can show this, we need a couple of lemmas. For the remainder 
of the section, let O = ( tj , "J, Q, 6 ) and O' = ( XJ , Q, Q, ^ ) be strongly complementary 
t-SCFAs in FHilb. 

Lemma 7.2.4. Up to a scalar, XT is a monoid over the classical points of Jh . For all i,j, the following are 
classical points for O': 



(ix/i 



o 



(7.4) 



Proof. We can show that the point labelled i ■ j is copied using the first bialgebra rule. 

AA AA 



lA/A /i\/l 




9 
o 



Deletion follows from the dagger of the second bialgabra rule. 

^ A AA 
A\/A 

= 1 




We can apply the bialgebra: 
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and 



9 
o 



Since the scalar <^ o 9. is non-zero, e is a classical point. □ 

This is the property that Coecke and Duncan refer to as closure. 
Lemma 7.2.5. For a strongly complementary pair of ' observables, ( tj ) ° T = JX an d ( 9 ) ° T = 6 • 
Proof. We can use the previous lemma to evaluate over classical points for tt . For multiplication: 





And for unit: 



O 



A simple consequence of this lemma is: 



v7 V 



n 



o 



Lemma 7.2.6. The antipode map S defined in figure {7.3* is self-adjoint and is an automorphism for both 
Frobenius algebras. 

Proof. For S to be self-adjoint it suffices to show we can interchange the caps and cups. 




To show S is a Frobenius algebra automorphism, we can use the previous identity and the fact 
that it is copied by 







□ 
Theorem 7.2.7. A strongly complementary pair of observables forms a scaled Hopf algebra with antipode S. 
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o 




Proof. The proof follows straightforwardly from the bialgebra identities and Lemmas 7.2.5 and 
EZ6] 





□ 

Using the results from this section, we can prove a stronger classification result than the ones 
given in 1 16 1 for strongly complementary observables in FHilb. 

Theorem 7.2.8. Let (G, ■, e) be a finite Abelian group of order D, and { \g) : g 6 G} be a D-dimensional 
orthonormal basis. Every strongly complementary pair 0,0' ofi-SCFAs is of the following form. 



&o ■■■\g)^ \g) ® \g) 
1 



eo ■■■ \g) H- 1 
n a ::1H y/D\e) 

Proof. First, we show this is indeed a strongly complementary pair. (So, £o) copies and deletes and 
orthonormal basis, so it extends to a t-SCFA. Also, up to a scalar, ("Ho 1 , Vo 1 , Ho" Vo' ) * s * ne m£ luced 
Frobenius algebra of the group algebra C [G] . It is a routine calculation to show that the factors of 
l/\/D and y/D cancel out where necessary in the monoid, comonoid, and Frobenius identities. We 
can give explicit forms for Sqi — \P QI and e , = rf Q ,. 

\ 

t0>----\g)^^jr E l«l)®l«2> 
V U gyg2=g 

To show specialness, evaluate u o S for any g E G: 



e , = VD(e\ 



Ho'^o' \g) = Vo> ( -t= E \8i) 



\g2 



D 



E Is) 



gVg2=g 



gVg2=g j 

Every element in G has exactly \G\ = D distinct factorisations (i.e. pairs (gh,h~^) for all h £ G), so 
V-O'^O' \g) ~ \g)- We can a l so compute the explicit form for the "cap" S > ° r\o<- 

dEI^> 

geG 

From this it follows that S \g) = jg^ 1 ) and (}io',vjo'>bO' e O,S) is (up to a scalar) the induced Hopf 
algebra of the group algebra C [G] . Therefore O and O' are strongly complementary. 



S irjo> = 5 , [Vd \e)\ = VD J] \g x ) <g> \g 2 
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Conversely, let ( xjJ , 9/ JX, 6) and (\f, 9, A, 6) be strongly complementary t-SCFAs. 
From Lemma 7.2.4 xf is a monoid over the classical points of JS . We can then evaluate both 

' over an arbitrary classical point. 



sides of the equation from Theorem 




Since S is a Frobenius algebra automorphism, it is a permutation of the classical points of jji . 
Thus the previous equation implies that for all i, there exists i' such that: 




In other words, all of the classical points of 
algebra C[G] for some Abelian group G. 



have inverses, so 



is isomorphic to the group 

□ 



For a D-dimensional space in FHilb, fixing J\ as the t-SCFA corresponding to the computa- 
tional basis B = {\i)}, we can find the basis inducing the group algebra C[Zq] corresponding to 
the order-D cyclic group by applying the D-dimensional Fourier transform Fjj to the elements of 
the computation basis: B' = {F D \i)}. Using the fact that C[G x G'] = C[G] <g> C[G'] for Abelian 
groups G, G', it is possible to compute the strongly complementary pair corresponding to any finite 
Abelian group G by decomposing it into its cyclic components. 

Example 7.2.9. In C 4 , let B be the computational basis. Then, we can compute the strongly comple- 
mentary basis B' = { |e;) } corresponding to Z2 x Z2 as follows. The 2D Fourier transform is just the 
Hadamard matrix H. So |e,-) = (H <g) H) \i). Writing these as column vectors in the compuational 
basis, we have: 

/T\ . f\\ . f}\ - f\\ 

B'= ' 



1 


1 


1 


-1 


1 


1 


1 


-1 


74 


1 


'74 


1 


' 74 


-1 


' 74 


-1 




w 




V-V 




V-V 




\i) 



In the case of complementary observables, it is often useful to know how big a complete set of 
mutually unbiased bases is for a given dimension. That is, a maximal set of bases such that is 
pairwise mutually unbiased. In the case of strongly complementary observables, there can only be 



two. 



Theorem 7.2.10. Let j\be a i-SCFA of dimension D > 2 and let ( A , i ) and ( j\, j\] 
strongly complementary pairs. Then Jk and Jjt cannot be strongly complementary. 



be 
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Proof. By contradiction. Suppose ( jk , J\ ) is a strongly complementary pair. The units J and 
9 must both be proportional to classical points of tx ■ We already showed that for any strongly 
complementary pair, 4 o <J = ^ o ^ = ^JD 7^ 0, so ^ and 9 must be proportional to the same 
classical point. Then: 




This is a contradiction because the LHS is invertible, while the RHS is rank 1 < D. 



a 



The classification of strongly complementary observables is much simpler than the general case. 
Whereas the maximum number of mutually unbiased bases of dimension 6 is still unknown, there 
is (up to isomorphism) one strongly complementary pair, corresponding to the cyclic group Zg = 
Z 2 x Z 3 . 

Phases have a special status for t-SCFAs in FHilb. Recall that phases are maps such that: 






From Proposition 3.2.15 we can put any phase in the form of the right multiplication by an 
arbitrary vector. Suppose a t-SCFA corresponds to a basis {|f)},thenp = J3|i) (ii\. For an arbitrary 
vector \ip) — Y^iXi \i), this is: 

/*(l®l*))=E«il*)(t| 

So, phases are precisely the maps that are diagonal in the basis defined by a t-SCFA. Unitary 
phases are precisely the phase gates familiar from quantum computing. 

For strongly complementary observables, the phases for ti associated with classical points of 
jS are Frobenius algebra automorphisms of Ct , up to a scalar. This follows from the bialgebra 
law. 



(7.5) 



7.3 The Z/X Calculus and Quantum Computation 

We have already seen that strongly complementary observables satisfy many graphical identities. 
This collection of identities is sometimes referred to as the calculus of complementary observables to 
emphasise that it can be used as a computational tool. We will now restrict our attention to the 
complementary pair Z and X, and show how we can use the Z / X-calculus to perform calculations 
on quantum circuits. 
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Definitions 7.3.1. Let Z = (C 2 ,Sz,£z) be the t-SCFA corresponding to the Z-observable and let 
X = (C 2 , Sx, ex) he the t-SCFA corresponding to the X-observable. 



fe::|0)i-H00),|l)i-Hll) 

Sx-- |+}^ l++),hW | — } 

For the remainder of this section, we will define: 

h ■= A £z := k 6 * 



e z ::|0)^l,|l)^l 
e x ::|+)->l,|-)->l 



ex 



=-A 



Up a global e' e factor, the unitary phases for Z are the phase gates Zg and the unitary phases for 
X are the phase gates Xg. We represent these as dots with a phase angle. 



X a := (a 



More generally, we can write arbitrary spiders with phases. 





Since the phase commutes with all of the Frobenius structure, it does not matter which leg of 
the spider we place the phase gate on. For this section, we will ignore (non-zero) scalars, as they 
will not be important for the calculations. Up to scalar factors, the following equations hold. 



Since Z and X are strongly complementary, the phase gates corresponding to classical points 
are Frobenius algebra automorphisms. Using equation | |7.5) and the fact that X a Z n oc Z n X- a for 
all oc, we have: 





There is only one Abelian group of order 2, so by Theorem 7.2.8 X is the group algebra C [ 
defined over the basis given by Z. Both elements of Z2 are self -inverse, so the antipode of the 
strongly complementary pair is trivial. 
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As a consequence, we can freely change the direction of any edge between dots of different 
colours, and we can delete any two parallel edges between dots of different colours. Finally, we 
introduce the Hadamard gate, which exchanges the colours of dots. 



1 



H:-- 



V2 \1 



1 1 



H 




H 




) 


£ 


( 


M 


I 
H 


... 


H 


1 








i 




i 



We refer to the bialgebra identities along with these additional rules as the Z/X calculus. 

7.3.1 Example: Building and Rewriting Circuits 

Consider the following map from C 2 — > C 2 . 






(S) 



-£ri = 



By evaluating the first qubit at |0) and |1), we can see that this map selectively applies X Tl 

Therefore, it is a CNOT gate. From this, we have a universality theorem. 

Theorem 7.3.2. The generators of the Z/X calculus are universal for quantum computation. 

Proof. We have already constructed a CNOT gate, so it suffices to show we can construct an ar- 
bitrary 1-qubit unitary. This is possible because every unitary map C 2 — > C 2 admits an Euler 
decomposition U = Z 7 XpZ a . That is: 



U 



(7.6) 



□ 

Example 7.3.3. It is a basic property of CNOT gates that three alternating applications yields a 
qubit swap: 
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-® 



<9 




We prove this using the Z/X calculus. 







More examples like this can be found in lTT6l . Hillebrand applied to Z/X calculus to a wide 
variety of security protocols in [31J, and Duncan and Perdrix applied it to MBQC in [25J. 
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Chapter 8 

Monoidal Algebra in Quantum 
Entanglement Theory 



We now turn out attention to a different topic in quantum information theory: multipartite entan- 
glement. In this chapter, we review several major results from the study of multipartite quantum 
entanglement. We then give an algebraic (i.e. diagrammatic) characterisation of a special class of 
highly entangled, symmetric states called Frobenius states. Frobenius states always induce com- 
mutative Frobenius algebras, and it can be shown that the two canonical maximally-entangled 
states on qubits, GHZ and W, can be distinguished by a simple property of this induced Frobenius 
algebra: specialness or anti-specialness. 

In studying GHZ and W states abstractly, we introduce the notion of a GW-pair. A GW-pair 
consists of a special commutative Frobenius algebra and an anti-special commutative Frobenius 
algebra, and it exhibits an interaction theory characteristic of the algebras induced by the GHZ and 
W states. We provide a behavioural intuition for the generators of a GW-pair, show that they are 
universal for quantum computing, and use them to encode arithmetic on the complex projective 
line. 

8.1 Classifying Entanglement 

Characterising general N-system entangled states is a very hard open problem in quantum infor- 
mation theory. Before talking about applications of categorical diagrams to the study of entangled 
states, we will briefly give some background and major results from the field. 

Bipartite states, i.e. quantum states consisting of two entangled systems, are fairly well under- 
stood. We can characterise bipartite states by "how much" entanglement they have: ranging from 
product states (which have no entanglement), to perfectly correlated states (which have maximal 
entanglement). 
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</>\/<p 



less entangled <- 



i r (8.i) 

>• more entangled 



This characterisation can be formalised using the majorisation order on bipartite states. This is 
done using the Schmidt decomposition. For any bipartite state | Y) 6 "H <g> "H, there exist orthonormal 
bases {|Uj)} and {|i>;)} and non-negative real numbers a,- such that: 

|Y) = £>; K> ® \ v i) 
i 

The numbers a,- are called the Schmidt coefficients of |Y) and are uniquely determined, up to 
permutation, by |Y). The number of non-zero Schmidt coefficients is called the Schmidt rank. By 
reordering the associated basis vectors, we can always assume these coefficients are in decreasing 
order ocq > a\ > . . . > «d-i- We can define the majorisation ordering on states |Y) , |0) 6 % ® W 
using their associated with Schmidt coefficients {a,} and {/5;}. 



|Y) < M |* 



«* v*. (h°z>hfi 



\ i=0 1=0 / 

Intuitively, states whose Schmidt coefficients are more evenly spread are higher in the majori- 
sation order. For instance, the product state |00) has Schmidt coefficients (1,0), whereas the Bell 
state | Bell ) = -i (|00) + |11)) has Schmidt coefficients (-73,-75)- Since \ < 1 and \ + \ < 1 + 0, 
|00) < M |Bell). 

This relation is transitive, reflexive, and anti-symmetric up to a change of orthonormal bases | «, } 
and \vj). We call two states that are equal up to a change of orthonormal basis on each subsystem 
LU-equivalent, or equivalent up to local unitaries 

Definition 8.1.1. Two states Y, <E> e % <8> • • • <8> H are said to be LU-equivalent if there exist unitary 
maps U{ :H — > H such that: 



/ Y^\ 









X _u 




= 






III 




u N 






i 




1 



So, <m forms a partial order on LU-classes of bipartite states. For qubits, <m is a total order, 
but it is not total in general. Consider two states in C 3 ® C 3 : 



IY) 



-L(^ioo) 



111) 



122: 



10) 



1 

7! 



v / 2|00)+V2|ll) + |22) 



Then, it is neither the case that |Y) <m l^) nor that \^) _^M |Y). However the minimal and 
maximal elements of <m are always unique, up to LU-equivalence. 
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We can get a better feel for what characterises these states by thinking of bipartite states as 
processes, or quantum channels, over which information can flow. We do this by employing the 
principal of channel-state duality. 

Fixing an orthonormal basis {\i)} 6 % fixes a unitary isomorphism % : 7-1* — > % sending (i\ 
to |z). As we have seen in the previous chapter, this is the same as fixing a t-special commutative 
Frobenius algebra. The map % is then the following induced isomorphism H* — W: 



X 




X' 



Up to normalisation factors, we can consider linear maps L : % — > H as bipartite states jT^} G 
L®L and bipartite states |Y) 6 H <8> H as linear maps L Y : H — > H. 




T 



T 




(8.2) 



This is the "pure" version of the Choi-Jamiolkowski isomorphism. The general statement, which 
includes mixed states, is as follows. For a finite-dimensional Hilbert space H, let C{T-L) be the 
vector space of linear maps T-L — >■ T-L. Positive operators correspond to mixed states and completely 
positive maps correspond to (mixed) quantum operations. 

Theorem 8.1.2. Positive operators L 6 £(H <g> H) are in one-to-one correspondence with completely posi- 
tive maps O : £(H) -> £{U). 

Since pure states (up to a global phase) are the same thing as positive operators of the form p : = 
\ip) (ip\ and pure maps L : H — >■ *H are the same thing as CPMs of the form ^(p) = LpL*, equation 
| |8.2) can be thought of as the pure fragment of channel-state duality. Under this correspondence, 
the Schmidt decomposition of a state is essentially the same as the singular value decomposition of 
its associated map. Let (v[\ = X*(\°i))> i- e - m e transposition of \v{) in the basis {\i)}. This clearly 
forms an orthonormal basis for T-L*. Using this basis, we can decompose Ly such that its singular 
values are the same as the Schmidt coefficients of |Y). 

|Y)=J3«,-|M,->«8)|l7 i ) <-> L = Yj K i\ u i)( v 'i\ 

i i 

In particular, the Schmidt rank of a bipartite state is the same as the rank of the associated map. 

Quantum teleportation is an archetypal example of regarding a bipartite state as a channel. 
The entangled state that Alice and Bob share provides the "medium" over which the unknown 
state is teleported from Alice to Bob. Suppose we considered variants of the teleportation protocol 
over an arbitrary finite-dimensional Hilbert space T-L, replacing the (perfectly correlated) Bell state 
with other kinds of bipartite states from chart | |8.1} . The states at the left extreme are the worst for 
teleportation. Regarded as channels, product states correspond to rank-one maps. No matter what 
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we put in to such a channel, we always get the same output, up to a scalar. Such channels cannot 
be used to send any quantum data. At the other extreme are the perfectly correlated states, which 
correspond to unitary maps under channel-state duality. Any such state can be used to construct 
a teleportation protocol which will always succeed. If this unitary LTy is not the identity (as in the 
case of the Bell state), we simply need to undo it by applying U^ to one of the sub-systems. 

In between these two extremes are maps L of rank 2 < r < D, which can be thought of as 
noisy channels. Suppose r = D, then \Yi) can at least in principal teleport a state, but it might 
be impossible for Bob to recover Alice's state deterministically However, in most cases, Bob can 
at least recover the state with non-zero probability. This is because an arbitrary linear map can be 
"applied" to a quantum system by first applying a big unitary to the system and an ancilla state, 
then measuring the ancilla. 



T 



u 



= 

If Bob gets outcome 0, then he has successfully applied L, otherwise he has applied some other 
(unwanted) map L' . Maps L that can be applied with non-zero probability are known as stochastic 
maps. In the case where r < D, states can only be teleported in a lossy sense, i.e. they are (non- 
deterministically) projected on to a subspace of W before being sent. Under channel-state duality, 
chart dS.lfc becomes: 



less entangled <- 








6 




U 



6 



-»• more entangled 



When we move from bipartite entanglement to multipartite entanglement, the picture becomes 
less clear. For one thing, this is no canonical analogue to the majorisation order involving three or 
more systems. However, we can define an operational ordering on states that is equivalent to the 
majorisation ordering in the bipartite case. Operational orderings relate states by the existence of 
certain types of quantum protocols that can convert one state into another. The most well-known 
type of protocol used for this purpose is a LOCC protocol. 

Definition 8.1.3. An N-partite state |Y) can be converted into \<S>) by Local Operations and Classical 
Communication (LOCC) if there exists an N-party protocol that can deterministically convert |Y) into 
|<&) consisting of any number of the following operations: 
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1. Party p, performs a local measurement of the z'-th subsystem (possibly with an ancillary sys- 
tem which only p, can access). 

2. Party p, performs a local unitary which can be conditioned on any previous measurement 
outcome in the protocol. 

Insuchacase, we write |<E>) <locc |Y). If two states are LOCC-interconvertible (i.e. \<£>) <locc 
|Y) and |Y) <locc \®)), we say they are LOCC-equivalent, written |Y) ~locc l^}- 

Nielsen and Vidal showed that the majorisation order is the same as the LOCC order on bipartite 
states [55 j. As a consequence, two states are LOCC-equivalent if an only if they are LU-equivalent. 
In fact, this relationship between LU-equivalence and LOCC-equivalence is true for any number of 
systems. 

Theorem 8.1.4 ([9]). Two N-partite quantum states are LOCC-equivalent if and only if they are LU- 
equivalent. 

Example 8.1.5. The following two bipartite states are LOCC-equivalent: 

\Bett) := ^ (|00) + |11» \EPR) := ^ (|01) + |10)) 

Even though there are infinitely many LOCC classes in the bipartite case, the majorisation or- 
der gives us a straightforward way to characterise LOCC-equivalence, as well as those states that 
are minimal and maximal with respect to <locc- However, for three or more systems, the picture 
becomes much more complicated. For that reason, it is convenient to introduce a course-graining 
of LOCC called Stochastic LOCC, or SLOCC. For SLOCC, we relax the requirement that the LOCC 
protocol succeeds deterministically and merely require that it succeed with some non-zero proba- 
bility. 

Definition 8.1.6. An N-partite state |Y) can be converted into |<E>) by Stochastic Local Operations and 
Classical Communication (SLOCC) if there exists an N-party LOCC protocol that converts |Y) into 
|<J>) with a non-zero probability. 

Again, we use <slocc an d ~slocc to represent SLOCC-convertibility and SLOCC-equivalence, 
respectively. It was shown by Diir, Vidal, and Cirac that SLOCC-equivalence can be characterised 
in a similar manner to LOCC-equivalence, but replacing unitary maps with invertible maps. 

Definition 8.1.7. Two states |Y) , |<E>) 6 1-L <g> . . . <g> T-L are said to be ILO-equivalent if there exist 
invertible local operations L; : V. — > H such that: 



/ Y X^ 




O 




JL _U 




= 






Li 




Ljv 






1 




i 
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Theorem 8.1.8 (HO). Two states |Y),|<E>) G U 
equivalent. 



% are SLOCC-equivalent iff they are ILO- 



We say a state |Y) is SLOCC-maximal if its SLOCC-equivalence class is maximal with respect to 
^SLOCO i- e - 1^} <SLOCC 1^) == ^' 1^) ~SLOCC 1^}- We define SLOCC-minimal states similarly. 
For any number of systems, the unique minimal SLOCC-equivalence class is the class of product 
states 1 00 . . . 0). For C 2 ® C 2 , the only other SLOCC class is generated by the perfectly correlated 
states. So, the Hasse diagram of SLOCC classes has only two elements. 



\Bell) 



< 



SLOCC 



For three qubits, there are still only six SLOCC classes (or four classes, up to permutations of 
qubits), there is no longer a unique maximal SLOCC-class. The Hasse diagram of the tripartite 
SLOCC classes is: 




Where \GHZ) = ^= (|000) + |111)), \W) = \ (|100) + |010) + |001}), and the \D { ) states rep- 



resent the three separable configurations of a Bell state with |0) . 



IDi 




|D 2 > 



\D 3 ) 




The complete classification for three qubits was given by Diir, Vidal, and Cirac [ 26 1 . They distin- 
guished GHZ and W by using an entanglement measure called the 3-tangle, or residual tangle. Intu- 
itively, this is the entanglement "left over" after bipartite correlations are subtracted out. States that 
are SLOCC-equivalent to \GHZ } have a non-zero 3-tangle. These states have some true 3-body en- 
tanglement, even accounting for bipartite correlations. However, states that are SLOCC-equivalent 
to | W } always have a vanishing 3-tangle, which means that all the entanglement present can be 
accounted for by bipartite correlations. Informally, we might depict the correlations in GHZ and W 
as: 
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GHZ: 



W: 




One could say that W has no "true" tripartite entanglement. One could also state this property 
positively by stating that W-like entanglement, unlike GHZ-like entanglement, is (partially) robust 
to the loss of a one system. That is, if we trace out one of the three parties, the reduced density 
matrix of W becomes (ignoring normalisation): 

p' w = (|10) + |01))((10| + (01|) + |00)(00| = V2 \EPR)(EPR\ + |00)(00| 

So, p' w corresponds to the probabilistic mixture of the (entangled) Einstein-Podolsky-Rosen state 
\EPR } and a separable pure state |00). However, if we do the same to the GHZ state, we get: 

Pghz=|00)<00| + |11><11| 

This is a mixture of two product states. Therefore it contains no entanglement if the third system 
is disregarded. 

There are various schemes for classifying multipartite entanglement which have had some suc- 
cess for small (typically < 6) numbers of systems. These approaches have had mixed results, as the 
general problem becomes very difficult for four or more systems. For four or more systems, there 
is necessarily an infinite number of SLOCC classes |26J. This is because the continuous degrees 
of freedom in the state space of N systems (2 N — 2) quickly overcomes the degrees of freedom 
available in a tensor product L\ . . . <8> Ljv of local invertible maps (6N + 2). Nevertheless, one 
sometimes give a finite set of SLOCC super-classes, i.e. SLOCC classes with some free parameters, 
which span the state spaces of these larger systems. However, there is no one right choice for a 
parametrisation, so these super-classes are not uniquely determined and often reflect the methods 
that were used to obtain them. 

Lamata et al introduced an inductive scheme for classifying multipartite states 114611 for N partite 
states in terms of the classification of N — 1 partite states. It works by treating N partite states as 
maps from H to ft®^" 1 ). 







./Y^v 


Ly 








^J 


i- 


•i 





N-l N-l 

They then look at the image V C t^sh^- 1 ) f £ Y and classify the vectors that span V (i.e. 
N — 1 partite states). Using their classification scheme, |GHZ ) is the unique, tripartite qubit state 
whose image is spanned by two product states { | ip\ ) <g> | <p\ ) , \ fa) 8> | ^2) } where | ip\ ) 7^ A | ip 2 ) ) and 
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\(p\) 7^ A' I <p2). \W) is the unique state whose image is two dimensional, but only contains one 
product state. The whole classification for tripartite qubit states is as follows. 



State 


V spanned by 


\GHZ) 


{|?l)®|fr),|?2>®|fc>} 


\W) 


{|t/>) <g> \<p), |Y)} 


\Di) 


M®\<i>i),m®\<p2)} 


\Di) 


{hfr>®l*),hfc)®l*>} 


|D 3 > 


(|Y)} 


|000) 


(|t/>)® !</>)} 



Since it can be proved that any two-dimensional subspace of C 2 <g> C 2 contains at least one 
product state, this list is exhaustive, and matches the classification given by [26]. It also reflects 
the fact that tracing out a qubit from | GHZ } yields a mixture of product states, while tracing out a 
qubit of | W) yields a mixture of a product state and an entangled state. 

In 1 47], the authors went on to use this technique to enumerate a finite set of 4-partite SLOCC 
super-classes spanning (C 2 )® 4 . However, the amount of calculations to ensure these super-classes 
were non-overlapping and spanned the whole space was much greater than in the tripartite case, 
and it seems likely that enumeration of super-classes with more than four subsystems would be 
significantly more difficult than the 4-partite case. 

8.1.1 Symmetric States 

The study of symmetric multipartite states is quite a bit simpler, because they have significantly 
fewer degrees of freedom. For instance, the space of N-partite symmetric states over qubits is 
spanned by the N-partite Dicke states: 



D 



(*)\ 



N / 



N-k k 

Where Sn is the N-system symmetrisation map. 

I 
S N :: |zi,z 2 ,..., i N ) H- — JZ 

7rePerms(N) 

So, for qubits, the space of N-system symmetric states is (N + 1) -dimensional. It can also be 
shown that an arbitrary N-partite state is the symmetrisation of a product state. 



l Tz(l)' l n{2)'- ■■' l n(N) 



IT) 



<Sn(|V1'V'2/---/</'n)) 



The factors \ipi) need not be distinct. Since the value of |Y) does not depend on the order of 
factors, |Y) is totally defined by the multiset of M < N distinct factors | <£>,): 

|Y) = <Sn( |^1/ • • -,<pl\ ® 1^2/- • -,<pz) ® • • • ® \<t>M,-- -,<pM.)) 

d 1 d 2 d M 
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We can also assume that the d{ are in decreasing order. The number M of distinct factors is called 
the diversity degree of a symmetric state, and the list X>y = [d\,d2, ■ ■ ■ ,^m] i s called its degeneracy 
configuration. 

Definition 8.1.9. Two N-partite states |Y) , |<1>) 6 1-[® N are called symmetrically SLOCC-equivalent if 
there exists a single invertible map L : H — >• H such that: 




L - L 



Bastin et al provided a classification result for symmetric qubit states of any size with a small 
diversity degree [8j. It relies crucially on the following fact about qubit states. 

Proposition 8.1.10. Two N-partite symmetric states are SLOCC-equivalent if and only if they are symmet- 
rically SLOCC-equivalent. 

Using this fact, they showed that for small diversity degree, SLOCC-classes are uniquely fixed 
by their degeneracy configuration. 

Theorem 8.1.11 ([8]). Two N-partite symmetric qubit states with diversity degree M < 3 are SLOCC- 
equivalent if and if only they have the same degeneracy configuration. 

They proved that any state with diversity degree M = N is SLOCC-equivalent to \GHZ-^) = 
|0...0) + |l...l). As a consequence, the SLOCC classes of N-partite symmetric states are com- 
pletely determined for N < 4. They also showed that for 4 < M < N, there are necessarily 
infinitely-many SLOCC classes of symmetric states. 

8.2 Strong SLOCC-maximality and strong symmetry 

In an effort to push state classification farther, we will identify the properties of that make GHZ 
and W states unique. In order to leverage techniques from categorical quantum mechanics and 
diagrammatic languages, we shall focus on properties that are compositional in nature. That is, we 
shall look at how GHZ and W states, treated as maps via channel-state duality, behave when they 
are composed with other states or themselves. 

Whereas bipartite states can be thought of as quantum channels, tripartite states can be thought 
of as algebraic operations which have two channels of input and one channel of output (or, equiva- 
lently, as coalgebraic operations from 1 input to 2 outputs). 
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The GHZ and W states are characterised as the unique tripartite qubit states that are SLOCC- 
maximal, so we shall look at how to characterise SLOCC-maximality abstractly. For bipartite states, 
this condition is equivalent to the a bipartite state forming the "cap" of a self-dual compact struc- 
ture. 

Proposition 8.2.1. A bipartite state |Y) 6 %&% is SLOCC-maximal iff there exists an effect (<J>| : 

H®l-L^€ such that 

' 'f 

(8.3) 




Proof. We first show that SLOCC-maximal maps have full Schmidt rank. Suppose |Y) has Schmidt 
decomposition: 

W = E a *' I"') ® \ V i) 
i 

If for any i, a, = 0, then there exists a map S with |w ( ) in its null space such that |Y) = (S ® 
1) |Y'). Since S is singular, it corresponds to a non-reversible stochastic local operation, so |Y) is 
not SLOCC-maximal. 

Since any SLOCC-maximal state must have full Schmidt rank, we can construct (<I>| as follows. 

(*l=£r>tl®("*l 



Then equation 1 8.3 1 is satisfied. Conversely, if |Y) were not SLOCC maximal, then |Y) = (S < 



1) |Y') for some |Y') and some singular map S. Clearly no bipartite state of this form could satisfy 
equation ||8.3). D 



Characterising SLOCC-maximal tripartite states is trickier. However, GHZ and W satisfy a 
stronger version of SLOCC-maximality. As in |46|, we can study tripartite entangled states by 
studying the states that span the image of the associated map <%. 




Span 




Span 




For convenience, we define the 3 bipartite image spaces of | Y) as follows. 

I mi (|Y)) = Span{ ((i\ (g)l <g>l) |Y) } 
Im 2 (|Y)) = Span { (1 <g> (i\ (8) 1) |Y) } 
Im 3 (|Y}) = Span{ (1 ® 1 ® (i\) |Y) } 

For states in (C D )® 3 to be SLOCC-maximal, these spaces must all be D-dimensional. In fact, 
this condition is equivalent to SLOCC-maximality. 
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Proposition 8.2.2. A tripartite state |Y) is SLOCC-maximal if and only if M,'(|Y}) is D-dimensional for 
i = 1,2,3. 

Proof. Suppose that Im.i(|Y)) is not D-dimensional. Then, there must exist (ip\ 6 C D such that 
((xp\ (g> 1 (3 1) |Y) = 0. Fix an orthonormal basis («,| spanning (ip| . Then define a singular map 
S = Li \ui) (uj\, and note that there exists |Y') such that (S <g> 1 <g> 1) |Y') = |Y), so |Y) cannot be 
SLOCC-maximal. The argument follows similarly for Im2(|Y)) and Im3(|Y)). 

Conversely, let |Y) not be SLOCC-maximal. Then it must be in one of the following forms: 

(S <g> 1 <g> 1) |Y') (l<g>S<g>l)|Y') (1 <g> 1 <g> S) |Y') 

In each of these cases, one of the spaces Im, (|Y) ) must be less than D-dimensional. □ 

It is always the case for SLOCC-maximal tripartite states that each of the spaces Im,- (|Y)) contain 
entangled states. However, for D > 2, it need not contain a SLOCC-maximal bipartite state. The 
case of D = 2 is degenerate in the sense that any entangled state is SLOCC-maximal. 

Definition 8.2.3. A tripartite state |Y) is called strongly SLOCC-maximal if each of its associated 
image spaces Im,(|Y}) contain a SLOCC-maximal bipartite state. 

Theorem 8.2.4. Strong SLOCC-maximality implies SLOCC-maximality. 

Proof. It suffices to show that the image spaces Im,(|Y}) are all D-dimensional. Let (i/>| be a state 
such that |<3>) = (1 (g) (tp\ (g> 1) |Y) is a SLOCC-maximal bipartite state. Then, fixing an orthonormal 
basis (z| for (C D )*, all states of the form 



77 \1v 
must be linearly independent for distinct values of i, so they span C D . Thus, non-projected states: 

Y 




must span a D-dimensional subspace of C D (g> C D . The result for Im2(|Y)) and InvjdY)) follows 
similarly. □ 

Remark 8.2.5. In the case of D = 2, any entangled bipartite state is SLOCC-maximal, so strong 
SLOCC-maximality and "weak" SLOCC-maximality are equivalent. However, for D > 2, the im- 



plication in Theorem 8.2.4 is strict. To see this, consider the following state in C 3 <g> C 3 (g> C 3 . 

|Y) = |000) + |101) + |110) + |202) + |220) 
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The spaces Imi(|Y)), Im2(|Y)), and Irri3(|Y)) are all 3-dimensional, so |Y) is SLOCC-maximal. 
Every state in Im, ( | Y) ) is of the form: 



IO) 



a |00) + b(|01) + |10)) + c(|02) + |20)) 



But |Y) cannot be SLOCC-maximal. If b — c — 0, then it is a product state, otherwise there 
exists a non-zero bra (£| :— c (1| — b (2\ such that ((£ | <8> 1) \&) = 0. 

When a state is symmetric we can simplify the strong maximality condition. A state |Y) is 
strongly SLOCC-maximal if there exist effects (f|, (<E>| such that: 



(8.4) 



The GHZ and W states are strongly SLOCC-maximal because they are SLOCC-maximal qubit 
states. In particular, ignoring scalar factors, the following equations hold: 






(8.5) 



The GHZ and W states have natural N-qubit symmetric analogues. 

|GHZ N ) := |00...0) + |ll...l) 

\W N ) := |10 . . . 0) + |010 . . . 0) + . . . + |0 . . . 01) 

Not only do they have such N-partite versions, they come with a recipe for inductively con- 
structing them. That is, for both of these states, there is a bipartite effect (<3>| that can be used 
to "glue" a tripartite state on to an N-partite state by projecting out a pair of qubits to make an 
(N + 1) -partite symmetric state. 

\GHZ N+1 ) = (1(E) (Bell \®1)(\GHZ N )(E)\GHZ)) 
|W N+1 ) = (1<8><EPJR|®1)(|W N )®|W» 

To inductively build a symmetric N-partite state, it suffices that the following condition hold. 

Definition 8.2.6. A symmetric state is said to be strongly symmetric if there exists some bipartite 
effect (<E>| such that 



(8.6) 
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8.3 Frobenius States and their Induced Frobenius Algebras 

A state that is strongly SLOCC-maximal and strongly symmetric is called a Frobenius state. 

Definition 8.3.1. A symmetric tripartite state |Y) € 'H®'H®'His said to be a Frobenius state if there 
exist effects (<E>|, (£| such that: 



(8.7) 



Note that (<3>| satisfying two equations in the definition must be the same effect. This is a stronger 
condition than stating that a Frobenius state is both strongly SLOCC-maximal and strongly sym- 
metric (i.e. these equations respectively hold for some possibly distinct effects (<3>| and (3>'|). 

Theorem 8.3.2 (Algebras as states). For any commutative Frobenius algebra ( TJr , ^ , Jk , 4 )/ the fol- 
lowing is a Frobenius state, with its two associated effects: 







V 



-i 



Proof. The induced triparite state is symmetric because Jk is co-commutative. The two Frobenius 
state equations follow from the spider theorem of commutative Frobenius algebras. □ 



The Frobenius state conditions from Definition 8.3.1 hold as a consequence of Theorem 3.2.20 



Also, from any Frobenius state we can construct the associated commutative Frobenius algebra. 

Theorem 8.3.3 (States as algebras). For any Frobenius state | Y), there exist effects (<E>|, (£| such that the 
following is a commutative Frobenius algebra: 



Y- 





! 



I 



Proof. Cocommutativity of Jk follows from |Y) being symmetric. Coassociativity follows from 
the symmetry of | Y) and the strong symmetry equation. 
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Associativity and the Frobenius law follow similarly. Strong SLOCC-maximality implies that 
(£| is a counit for J^ and that ((£ | (g) (f| <g> 1) |Y) is a unit for V . □ 

We can therefore define a commutative Frobenius algebra using either the usual maps (pi, rj, 5, e), 
or a triple (|Y) , (<E>| , (£|) consisting of a Frobenius state and its two associated effects. Also, note 
that for a given state |Y), there could be multiple induced commutative Frobenius algebras based 
upon the choice of (£|. However, once (£| is fixed, (<&| is completely determined. This is analogous 
to the situation with Frobenius algebras where the maps u and e completely determine the other 
two. 

Theorem 8.3.4. If (|Y) , (<3?| , (£|) is a Frobenius state and |Y'} fs symmetrically SLOCC-equivalent to 
|Y), f/ien |Y') extends to a Frobenius state in such a way that the induced commutative Frobenius algebras 
o/|Y) and |Y') are isomorphic. 

Proof. Let |Y') = (L ® L (g) L) |Y) for some invertible map L. Then let (£'| = (£| L^ 1 and (<J>'| = 
($|(L _ (8L). It is straightforward to verify the Frobenius state axioms, and constructing the 
CFA as in Theorem |8.3.3| we have: 



JL, ^L 




JL, 




L- 1 L- 1 

Y 


T 

L 
1 


L- 1 

A 


I 
L- 1 

4 


L 




L L 





8.3.1 Classification of Qubit Frobenius States 



a 



In section 3.2.2 we defined special and anti-special commutative Frobenius algebras. These are 



both defined as CFAs with one additional axiom. 
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SCFA „ ^ AGFA Q 



c i 



We show in this section that GHZ and W, the two canonical SLOCC-maximal tripartite qubit 
states, are both Frobenius states. Furthermore, the specialness and anti-specialness condition serve 
to distinguish the (symmetric) SLOCC-classes of these two states. 

It will be convenient to work with the unnormalised versions of | GHZ } and | W ) . 

| GHZ) = |000) + |111) \W) = |100) + |010) + |001) 

For the Frobenius state |GHZ), fixing (£| := (0| + (1| induces the following CFA, which we 
shall refer to as Q. 



(8.8) 



= |0)(00| + |1)(11| $ = V2 |+) = |0) + |1) 

^ = |00)<0| + |11)<1| 6 = ^2(+l = (0| + (l| 

We can verify that Q is special. 



Y°A = io><oi 



I1W1I = 1 



C2 



(8.9) 



For the Frobenius state |W), fixing (£| := (0| induces the following CFA, called W. 

Y = |l) (11| + |0) (01| + |0) (10| f = |l) 

X = |00)(0| + |01)(l| + |10)(l| i = (0| 

Computing the partial traces of multiplication and comultiplication, we get: 

^ = Tr c2 (X) = E(l ® <»|)C|00> (0| + |01) (1| + 1 10) (1|) \i) = 2 |0) 

i 

i = Tr c2 ( Y) = J2(i\ (|1) (11| + |0) (01| + |0) (10|)(1 ® |i» = 2 (1| 
i 

We can then verify that W is anti-special. 

o® ( Y ° A ) = 2 (i°) (!i + 1°) ( a i) = ( 2 1°))( 2 ai) = ¥ ° i 

The next theorem is a straightforward consequence of the classification of tripartite qubit states 
and Theorem |8.3.4| 

Theorem 8.3.5. Any SLOCC-maximal, symmetric state |Y) 6 C 2 ® C 2 ® C 2 is a Frobenius state. It is 
SLOCC-equivalent to |GHZ) (resp. \W)) if and only if the associated commutative Frobenius algebra is 
special (resp. anti-special). 
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Proof. We first show |Y) is a Frobenius state. Since it is SLOCC-maximal, it must be SLOCC- 
equivalent to GHZ or W. Since it is a symmetric qubit state, it must be symmetrically SLOCC- 



equivalent to GHZ or W by Proposition 8.1.10 Therefore it is a Frobenius state. 



If |Y) is SLOCC-equivalent to GHZ, its induced CFA is isomorphic to Q by Theorem 8.3.4 



so 



it must be special. If |Y) is SLOCC-equivalent to GHZ, its induced CFA is isomorphic to W, so it 
must be anti-special. 

Conversely, let |Y) be the induced Frobenius state of a special commutative Frobenius algebra 
S. It must be SLOCC-equivalent to GHZ or W. Suppose it were to be SLOCC-equivalent to W, 
then S is isomorphic to an anti-special commutative Frobenius algebra, which is a contradiction 
for dimensions > 1. So, |Y) must be SLOCC-equivalent to GHZ. The result follows similarly for 
ACFAs. □ 

8.3.2 Classification of Frobenius States for Higher Dimensions 

We can push this classification a bit farther, but less is known in higher dimensions. 
Example 8.3.6. We can produce D-dimensional analogues of the GHZ state and the W state. 

GHZ( D )\=£|m) 

i 

W< D >) = 1 100} + 1 010) + 1 001) + £(|0zz) + I z"0z> + |«0)) 

Theorem 8.3.7. 

GHZ( D ' \ is always special and the induced CFA for W^ D ' \ is always anti-special 



GHZ( D > \ and W^ D > \ are both Frobenius states for any dimension. The induced CFA for 



Proof. For GHZ* ) V the associated effect in (C D )* is £,- (i\ and for w( D '\, it is (0|. The rest of 
the structure is uniquely determined, and the verification of SCFA and ACFA axioms is straightfor- 
ward. □ 



Theorem 8.3.8. For any D-dimensional SCFA, the induced Frobenius state is SLOCC-equivalent to 



GHZ^ 



Proof. Let (V., u, n, 5, e) be an SCFA. Then so too is (H*, 6*, e* , u* , n*). Any special Frobenius alge- 
bra over C is semisimple, and any commutative semisimple algebra over C is isomorphic to the 
direct sum of D copies of (C, ■ ) . So, for a (not necessarily orthonormal) basis { |«;) }, ]i has the form: 

6 :: \ui) i— >• \ui,ui) 

For some arbitrary n = £j «; |Mj), the tripartite state is: 

|Y) = {\® 5) o 5 on = Y\(X-i \iii,Ui,Ui) 



152 



For |Y) to be SLOCC maximal, all of the scalars a, must be non-zero. Let L be defined as: 



1 

L :: \ui) h- > — «,-) 
a,- 



Then, applying L to any of the subsystems yields 

1 

which is clearly SLOCC-equivalent to GHZ( D )\. D 

A complete classification of ACFAs is not yet complete. For dimensions D < 6, there are rela- 
tively few commutative algebras, up to isomorphism, so it is feasible to enumerate them and check 
which extend to ACFAs. For D < 4, there is only one ACFA, and its Frobenius state is SLOCC- 
equivalent to W' ^ V However, for D = 4, there are already two non-isomorphic ACFAs, so the 
classification of these types of Frobenius states may be more difficult in generally] A classification 
of all Frobenius states for D < 6 is in progress, and will be included in a forthcoming sequel to [19J. 

8.4 A Graphical Theory for Entanglement 

We now look at some of the attributes of the pair of commutative Frobenius algebras corresponding 
to the GHZ and W states. Q = ( xf , 9/ P. , 6 ) is a t-special commutative Frobenius algebra, so 
it has an orthonormal basis of classical points. For W = ( IF , ^ , Jk , <j> ), note that y defines a 
partial monoid over the classical points of Q, and jk a partial monoid over their adjoints. In other 
words, \ikj) and (ifj\ defined as follows must either be classical points for Q or 0. 





To handle the case where ikj = _L or if] — _L, we use the following notation for "undefined 
points": 

m = o 



T 

Since W is not a t-CFA, it is not necessarily true that (|z'A/)) = (iVj\. We will shortly see that 
these two partial monoids are isomorphic, but they are never identical for non- trivial ACFAs. The 
other thing to note is that both the unit and the anti-unit are proportional to classical points of Q. 



T 



? = o 



1 Thanks to Alex Merry for performing these calculations. 
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We can build up these features abstractly. First, we generalise several results from section 7.2 
to the case where one of the Frobenius algebras is not t-special. The next definition provides a 
condition for CFA to form a partial monoid over an orthonormal basis. 

Definition 8.4.1. An arbitrary commutative Frobenius algebra ( T» , ^ , Jk , 4 ) is said to be closed 
on a t-special commutative Frobenius algebra ( XjJ , ^ , fX , 6 ) if the following equations hold: 



A = h Y = n 



For the rest of this section, let ( 9f , ^ , Jk , 4 ) be a commutative Frobenius algebra closed on 
at-SCFA(y,9,^,6). 

Theorem 8.4.2. For a commutative Frobenius algebra ( ^f , f, Jk, 4), closed on ( y , 9/ A / 6)' ^ e 
dualiser is unitary and self-adjoint. Furthermore, it is a permutation of the classical points of 



Proof. We can compute caps and cups in terms of ▲ and T. It follows from Definition 8.4.1 that 
^ = \u) is a classical point for <*> and 4 — ( c | is the adjoint of a classical point. So: 



X= e iw') y = e 



(h]\ 



From these, we can compute the dualiser and its inverse. 



d = (\J- E m\ 




E 1001 

ik.)=c 



Written in the basis given by p, , these are both binary matrices (i.e. matrices whose entries 
are all either or 1). But then, the only binary matrices whose inverses are also binary matrices are 
the permutations, so d must be a permutation, and d _1 = d + its inverse permutation. The fact that 
this map is self-adjoint follows from the symmetry of caps and cups. Evaluating the cup at classical 
points, we have: 





□ 



For the remainder of this chapter, we will represent the dualiser by placing a tick on an edge. 
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The dualiser generates a monoid isomorphism from ▲ to T : 



X\ 





^n 



T 



(8.10) 



Definition 8.4.3. LetS= ( \f , 9, J^ r &) be a t-SCFA and let ^ = ( V, f, ^, 4) beanACFA. 
Then (S, A) is called a GW-pair if „4 is closed on 5 and the following equations hold: 



o 



?? 



o 



u 



Technically, we only have to require that V be proportional to a classical point. Then, the fact 
that 4 o Ty = O) = D uniquely fixes the scalar factor. Furthermore, we can prove that the dualiser 
interchanges the unit and anti-unit. 



Lemma 8.4.4. The following equations hold for any GW-pair: 

0(?) + = i o(4) + = ¥ 

Proof By Definition < 



0? = ? 



8.4.3 



TJf and JL correspond to classical points for JX , up to scalar. By closure, 
J and 4 ar e also classical points. Since 4 ° y — (3 7^ 0/ then ( 4 ) and V must be proportional. 
By closure, 4 is equal to a classical point, hence it is normalised. So (~J ( ^ ) — A ■ The second 
equation holds similarly. We can also show that * = ( 4 ) : 



?-(\J-(°> 



Since 4 is the adjoint of a classical point, ( 4 ) 



4 ) = ?. The final equation then follows. 

a 



A consequence of this lemma is that the monoids ▲ and T cannot be equal for dimensions 
D > 2. By anti-specialness and equation ( 8. 10} , that would imply the identity was rank 1. In 1191 , 
Coecke and Kissinger identified four axioms for interacting GHZ- and W-like states. 



(i.) 





(ii.) 



(iii.) A = H 

Theorem 8.4.5. Any GW-pair satisfies axioms (i.)-(iv.). 



(iv.) 



I 
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Proof. Axioms (i.) and (ii.) are consequences of Theorem 8.4.2 Axiom (iii.) is by definition, and 
axiom (iv.) is part of Lemma 8.4.4 □ 



It is known that not all SCFA/ACFA pairs in Rel satisfy the closure identities, so it is likely that 
the GW-pair conditions are strictly stronger than axioms (i.)-(iv.). However, the Frobenius states 
GHZA ' ) and W' ' ) provide at least one example of a GW-pair for any dimension in FHilb. 



8.4.1 Symmetric Modules of an SCFA and Distributivity 

For any Hilbert space Ti, we can form the space Ti ®s "H- or symmetric vectors in Ti <8> Ti. There is a 
canonical projection from an arbitrary vector in Ti ® Ti onto a symmetric vector in Ti ®$ Ti, called 

the symmetriser map S^ 




Any monoid {T-L, \i, r\) can be extended to a monoid {Ti ®s H> V-Si Y ls) by using the symmetriser. 








There is a canonical /ig -module over the whole space Ti<g)Ti. It is essentially the regular module 
of jig/ but without the symmetriser maps on the right. 




We shall refer to this as the extended regular module Xp For any SCFA, there is also a ^fg-module 
k/ u .\ over Ti for every classical point |u,-). 




For a GW-pair, (c| := i is the adjoint of a classical point, so it induces a /ig-module on Ti. We 
call a GW-pair distributive if the multiplication tf : Ti ® Ti — >■ Ti is a ^g-module homomorphism 

£rom(ft<8)'H,5c f j)to(ft,fc<c|)- 

Definition 8.4.6. A special commutative Frobenius algebra ( y , ^ , £>, ^) and an anti-special 

commutative Frobenius algebra ( TF , f , Jk , 4 ) form a distributive GW-pair if they are a GW-pair 

and: 



(8.11) 
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We refer to this condition as distributivity, because it resembles the distributive law for rings. To 
see this most clearly, consider arbitrary vectors \a) , \b) , \c) 6 M.. Noting that \a) <g> \a) is symmetric, 
we can prove the following equation: 



(8.12) 



Ignoring scalars, and writing "+" for ff and "■" for XJ , this equation becomes: 

a- (b + c) = (a-b) + (a-c) 

It reduces to the distributive law familiar from arithmetic. A slightly different, but equivalent 
way to see this is 9F copies phases for y , up to a scalar. 







This is similar to the case for strongly complementary observables, as in equation | |7.5) from 
However, the condition here is much stronger, because TF copies arbitrary phases for 



section 



7.2 



TT , rather than just those corresponding to classical points. 

Example 8.4.7. The Frobenius algebras Q, W defined in section [8.3.1 form a distributive GW-pair. 

The distributive law implies many identities between W and XJ . In the next lemma are two 
equations that we shall find useful in the next section. 

Lemma 8.4.8. For any distributive GW-pair, the following equations hold: 



t 
1 



Proof. The first equation follows from distributivity, noting that ^\ is symmetric. 

.XX O 






i 
T 



i 
T 
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The second equation then follows from the first one. 





a 



8.4.2 Universality 

Returning to the example of the GHZ and W Frobenius algebras, we treat ( XJ , 9, A, 6) and 
( T>r , y , Jk , 4 ) as generators for quantum circuits and look at their computational power. While 
they are not gates themselves, as they are not unitary, we shall soon see that we can think of certain 



gates as begin composed from these generators, as we did for the Z/X calculus in section 7.3.1 



Alternatively, one can think of the GW-pair maps as stochastic gates, prepared using post-selection 
or some more sophisticated measurement-based scheme. 

First, note that the dualiser interchanges ^ = |1) and ? = |0), so it serves as a NOT (i.e. Pauli 
X) gate. 

While the GHZ dot copies both f and ^, the W dot acts like a "controlled" copy. 



?? 



TT 



A= f? A-* 



With this behaviour in mind, we can use these generators to build a CNOT gate. 



< 



-e 



We can verify that this is indeed a CNOT gate by rewriting. 



Q-H«C ■- 



rrrrt-i 






4M 



Theorem 8.4.9. T/ze generators of Q and W, a/ong- with single-qubit states, are universal for quantum 
computation. 
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Proof. We have already illustrated the construction of a CNOT gate. To complete the proof, it suf- 
fices to show that we can apply arbitrary single-qubit unitaries. We can actually do better than this 
by showing we can apply arbitrary single-qubit linear maps. We can write a general single-qubit 
diagonal matrix as a GHZ phase. Let \ipi) = a |0) + b |1). 




«|0><0|+b|l>(l| = (p f) (8.13) 




v b 

For \ip2) = c |0) + |1), we can construct an arbitrary unit-diagonal upper triangular matrix as a 
W phase. 

Ap\ 

|0)<0|+ C |0><1| + |1)<1| = (J J) (8.14) 

For |!/?3) = d |0) + |1), we can construct an arbitrary unit-diagonal lower triangular matrix by 
applying the dualiser. 

|0)<0|+d|l)<0| + |l)<l| = (J J) (8.15) 

Any linear map decomposes as M = PLDU, where P is a permutation, L is a unit-diagonal 
lower triangular matrix, D is diagonal, and U is a unit-diagonal upper triangular matrix. Since the 
only permutations on C 2 are the identity and 4;, we can construct any single-qubit map using 4^ 
and the maps above. D 

8.4.3 Arithmetic on the Complex Projective Line 

It is a well-known fact that points on the Bloch sphere correspond to points on the complex projec- 
tive line. Any state a |0) + b |1) where b 7^= can be represented, up to a scalar by the quotient |. 
Defining \ b/a ) := |0) + | |1) and |oo) := |1), we can cover the entire Bloch sphere. Then, the usual 
projection of a sphere on to CP takes these states to their corresponding points in CP . 
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Under this correspondence, we can see that the algebra induced by the W state is addition 
on CP and the GHZ algebra is multiplication. Before we illustrate this, we define addition and 
multiplication on CP as commutative partial monoids. For k\,k 2 6 C, addition and multiplication 
are defined as usual. For the rest, let A: 6 C be a non-zero complex number, and let _L represent 
undefined. 



k ■ oo = oo 
O-oo = _L 



k + oo =oo 
+ oo = oo 



(8.16) 



00-00 = 00 oo + oo = _l_ 

Intuitively, these are addition and multiplication operations for "formal fractions" over C. That 
is, equivalence classes of pairs of complex numbers: 

\(d,n)\ = {(d,n) ~ (Ad,An) : A e C - {0} } 

Letting oo := (0, 1) and _L := (0,0), we can reproduce the above multiplication tables with: 

|(di,ni)| + \{d 2 , n 2 )\ = \{did 2 ,nid 2 + n 2 di)\ |(di/"i)| • \{d 2 ,n 2 )\ = \{d\d 2 ,n x n 2 )\ 

Using the convention that | _L) =0, it is straightforward to verify the following equations. 





It follows from equation 8.11 that the finitary points in CP distribute over addition. That is, for 

/ceC: 
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Distributivity fails for oo, as is usually the case when formally introducing points at infinity. 





The relationship between GHZ states, W states and the arithmetic of fractions is explored in 
detail by Coecke, Kissinger, Merry, Roy in USUI . 
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Part III 



Automation 
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Chapter 9 

Automating String Graph Rewriting: 
Quantomatic 



The Quantomatic Project |40| provides a set of tools for working with string graphs and string 
graph rewrite systems. It is divided into three parts: QuantoCore, QuantoGUI, and QuantoCoSy. 
The first is called QuantoCore, which is an ML library that does most of the work in representing, 
manipulating, and rewriting string graphs. QuantoCore uses three basic kinds of files. 

1. *.graph files store string graphs. 

2. *.theory files store graphical theories. A graphical theory contains information about what 
kinds of vertices can occur in a string graph, what kinds of data can occur on vertices, and 
how that data should be matched. 

3. *.rules files contain sets of string graph rewrite rules. 

On top of QuantoCore, we have developed a graphical user interface called QuantoGUI. Cur- 
rently, QuantoGUI can: 



create and edit string graphs (Figure |9T) and string graph rewrite systems (Figure 9.2 1, 

search for rewrites in a selected subgraph and apply them manually (Figure [9~3| , 

display animated normalisations of string graphs with respect to a rewrite system, 

do "fast-normalisation" of string graphs and only display the output, and 

interact with computer algebra systems to perform concrete calculations of string graphs as 



linear maps (Figure 9.4 see [39] for details). 



While it is currently quite minimal, we intend to make QuantoGUI into the graphical analogue 
of a proof assistant. Just as Isabelle [56 1 or Coq IfTUl exposes a variety of techniques for constructing 
formal proofs with respect to a term rewrite system (i.e. an algebraic theory), Quantomatic aims 
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Figure 9.1: A string graph in Quantomatic 



to do the same for string graph rewrite systems. The real power of systems like Isabelle and Coq 
comes not just from rewriting, but from the application of inductive reasoning techniques to prove 
theorems in first- or higher-order logic. These techniques do not always extend straightforwardly 
from terms to graphs. However graphs with quantifiers (defined by Rensink et al in |60|), pattern 
graphs, and graphs defined by grammars can give us the ability to reason about an infinite set 
of graphical equations simultaneously. We discuss how some of these more advanced techniques 



might work in section 10.1.5 



For more details about the Quantomatic project and to download the software itself, visit the 



project's web page at http://sites.google.com/site/quantomatic 



9.1 Conjecture Synthesis and QuantoCoSy 

In this section, we will discuss QuantoCoSy, the third component to the Quantomatic project. It 
performs automated theory creation using a technique called conjecture synthesis. 

One of the main goals of automated reasoning is to reproduce as much as possible on a machine 
the way a human mathematician thinks and a works. Consider a situation where a mathematician 
has the following: 

1. A set of generators for a new algebraic object X. 

2. A concrete model or set of models for X. 
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Figure 9.2: Editing a rule 
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Figure 9.3: Applying a rewrite 
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In[54if]:= t#rm = Hi lb [ ( | 
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ln[S54]:= guantoKil 1 [ ] 3 

Figure 9.4: Invoking Quantomatic from a Mathematica |65 1 notebook 

Though X is not defined yet, the models that the mathematician has in hand are things that 
"morally" should be X's. For instance, if the mathematician were trying to develop an algebra for 
studying maximally entangled states, as in chapter [§1 these would be a set of known maximally 
entangled states. From this data, the mathematician now seeks to axiomatise X. 

One way he or she could start this process is to "plug-and-chug". That is, the mathematician 
could plug these generators together randomly and see which compositions equal other composi- 
tions. In actuality, this process is not totally random, as the mathematician calls upon experience 
and a handful of helpful heuristics for seeking out likely equations. 

Heuristic 1: seek familiarity, the mathematician may discover that these generators are actually 
satisfying some properties of a known algebraic object, such as a Hopf algebra. From this, the 
mathematician deduces that the generators are more likely to satisfy the rest of the identities of a 
Hopf algebra than they are to satisfy some other, randomly chosen identities. 

Heuristic 2: avoid redundancy. Once the mathematician has a handful of identities, then while 
searching for new identities, he or she will avoid those which are trivially derivable from those 
already known. For instance, if the mathematician already knows a ■ (b + c) = (a ■ b) + (a ■ c), it is 
redundant to consider the terms a ■ (b + c + d) and (a -b) + (a ■ c) + (a ■ d). 

Heuristic 3: elegant identities are essential. Though the generators in question may exhibit 
large, complex, and asymmetric identities, the mathematician is treating the generators as merely 
one example of a more abstract mathematical object. His experience tells him that simpler identities 
tend to be the most crucial in characterising abstract mathematical objects. 

Though there is theoretically no end to this procedure, the mathematician may hold a convic- 
tion, as per heuristic 3, that he or she will eventually find no more "interesting" identities above a 
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certain size. At this point, the procedure is effectively complete. 

This section is about reproducing this process of (graphical) theory generation with a program 
called QuantoCoSy 

9.1.1 Conjecture Synthesis for Terms 

Conjecture synthesis is a technique that automatically generates "reasonable" conjectures to test for 
an algebraic theory. This procedure for term-based theories was introduced by Johansson, Dixon, 
and Bundy in 2010 [33] . The tool that implements their technique is called IsaCoSy (for Isabelle 
COnjecture SYnthesis). A single round of their algorithm proceeds as follows: 

1. Initialise a conjecture as an expression with holes, e.g. "(*) = (*)"• Holes mark places where 
there is more expansion to do. They also come with certain constraints on what terms can 
be instantiated in them. Initially these are maximum size or depth constraints to guarantee 
termination, but they will get updated later. 

2. Substitute holes with all possible terms-with-holes. This is done by a depth-first enumeration 
of possible substitutions, respecting the constraints on holes. 

3. Once there are no holes, save the expression as a possible conjecture. 

4. Perform (fast) post-filtering of conjectures that are obviously not true. IsaCoSy does this by 
using Isabelle's fast counter-example search. 

5. Try to prove the remaining conjectures using an automated proof search routine. When a 
proof is found, save the conjecture as a new rewrite rule. 

When viewed as a single round, this looks like the naive technique that one would use to go 
about searching for conjectures. However, the interesting part is how constraints are updated be- 
tween rounds. A given round of the synthesis procedure will produce a set of true equations, E. We 
can turn these equations into rewrite rules by putting a reduction ordering on terms (see Definition 



4X7] in section |4T). 

S = {*!-> h : {(t lr t 2 ) G EV {t 2 ,h) e E)Aw(fi) > tt(t 2 )} 

Let R be the set of all terms occurring on the LHS of rules in S. These are called reducible 
expressions, or redexes. For a rewrite rule r = (t\ — > ti\ any equation containing t\ can already 
be proved using r and an equation containing t%. As conjectures involving redexes are redundant, 
they are never considered. We can cut them out of the search space by updating the constraints on 
holes, such that the search performed in step (2.) above never generates terms containing redexes. 

While this is a fairly simple procedure, in practice, this can exponentially reduce the search 
space. 
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Figure 9.5: QuantoCoSy runs in Firefox using PolyChome [53 |, an extension for running Poly/ML 
in a web browser. 

9.1.2 Adapting Conjecture Synthesis to String Graphs 

The synthesis procedure for string graph identities is similar to that for terms. The procedure 
described in this section is implemented on top of QuantoCore with a program called QuantoCoSy 
(Figure |9~5) . 

In the term case, a single round of synthesis is parametrised by two natural numbers: the max- 
imum term size (or term depth) and the maximum number of free variables occurring in the term. 
For string graphs, we parametrise a run with four natural numbers: the number of inputs M, the 
number of outputs N, the maximum number of box-vertices B, and the maximum number of plug- 
gings P. We enumerate string graphs by starting with disconnected string graphs, i.e. graphs only 
containing box-vertices and their adjacent edges and wire-vertices. For instance: 



k AYm 



The synthesis procedure for string graphs takes as input: 

1. A string -graph signature T. For simplicity, we will focus on string graph signatures with only 
a single wire type. 
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2. An (m, n)-tensor for every box in T with m inputs and n outputs. 

3. A function <x) from string graphs to a well-ordered poset (P, <). This will serve as a (candi- 
date) reduction ordering. 

Throughout the synthesis, we maintain a rewrite system S, and a set R of reducible string 
graphs. A single run given by natural numbers (M, N, B, P) consists of the following steps: 

1. For all p such that < p < P, generate all disconnected string graphs with M + p inputs and 
N + p outputs, up to isomorphism. 

2. For a disconnected string graph with M + p inputs and N + p outputs, there are (M + p) ■ 
(N + p) input/output pairs. Choose p of them to plug together. These are chosen so that the 
enumeration is exhaustive and minimises the occurrences of isomorphic string graphs (any 
remaining isomorphic graphs will be filtered out later). After each plugging, if a string graph 
contains an element of R as a subgraph, terminate that branch of the enumeration. 

3. Evaluate the string graphs as tensors, performing a tensor contraction for every edge (c.f. the 



construction of F in Theorem 5.5.10 i. Organise them into equivalence classes, up to scalar 
factors and permutations of inputs and outputs (which are stored with the string graphs). 
Filter out any remaining isomorphic graphs. 

4. For each equivalence class C, identify a set C' C C of minimal elements with respect to to. 
Add any string graph in C — C' to the set of reducible graphs R. Choose a string graph s E C 
at random and add rules t — > s to the rewrite system S for alH 6 C — C' . Add rules in both 
directions (s — > s', s' — > s) for the other minimal graphs s' 6 C' — {s}. 

We postpone filtering out isomorphic graphs until step (3.) because tensor contraction is fast, 
and two graphs will not be isomorphic unless they are in the same equivalence class. We choose 
to such that step (4.) picks out as few graphs in C' C C as possible. If C' is a singleton, all of the 
rewrites respect the reduction order. Rewrites that do not strictly decrease co(G) (i.e. rewrites from 
an element of C' to another element of C') are called congruences. We can retain a terminating rewrite 
system if we throw out all of the congruences, but not without losing some information about the 
model. Therefore, a large portion of the refinement of this technique has to do with eliminating 
congruences or handling them in smarter ways (i.e. building them in to the graph representation). 



We applied QuantoCoSy to generators of the GW-pair given in section 8.4 We preloaded the 
synthesis procedure with rewrites that merge two vertices of the same colour (i.e. the spider laws), 
and synthesised graphical identities for B = 3, P = 3, and M + N < 3. This yielded 223 rewrite 



rules, most of which were versions of the four axioms given in Theorem 8.4.5 
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Figure 9.6: Rewrite rules synthesised from the generators of the GW-pair defined in section 8.4 



As was the case for terms, filtering out redexes has a huge impact on the number of string 
graphs that need to be checked. The naive synthesis procedure with the same parameters yielded 



over 20,000 rewrite rules. In Figure 9.1.2 we plot the number of rewrite rules generated using a 



naive graph enumeration algorithm against the number generated using the procedure above. 

While the redex-elimination procedure yielded a much more manageable number of rewrite 
rules, a quick look at the rules by a human will show that many are still trivially consequences of 
each other. Therefore, there is still much work to be done in eliminating redundancy in rules and 
building more of the symmetries of a rewrite theory into the graphical representation itself. We 



discuss some of the ways in which we hope to accomplish this in sections 10.1.4 and 10.1.5 
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Chapter 10 

Conclusion 



The main contributions of this dissertation fall under three categories: (1) the definition and proper- 
ties of string graphs, (2) the application of string diagrams /string graphs to quantum computation, 
and (3) the automated generation and manipulation of string graph rewrite systems. 

First, we defined string graphs and string graph rewrite systems using double-pushout graph 
rewriting. We also introduced the notion of composition for string graphs via certain pushouts 
called pluggings and defined a category whose morphisms are string graphs modulo a rewrite sys- 
tem, where categorical composition is defined using plugging. Using these rewrite categories, we 
constructed the free traced symmetric and the free compact closed categories over a monoidal sig- 
nature. These results allow us to prove identities in an arbitrary symmetric traced category using 
graph rewriting. 

In EH , Dixon and Kissinger proved that rewrite categories are equivalent to their topological 
analogues, as defined by Joyal and Street in l35l . Since free categories are defined up to equivalence, 
the results in this dissertation imply two of the missing "GTC-II" theorems. Namely, a category 
defined using string diagrams forms the free symmetric traced (or compact closed) category over a 
monoidal signature. 

Next, we illustrated the application of diagrammatic languages to the study of quantum phe- 
nomena: namely complementary observables and multipartite entanglement. After reviewing Co- 
ecke and Duncan's characterisation of complementarity using interacting Frobenius algebras, we 
proved several new results, including a complete classification of pairs of strongly complementary 
observables in an finite dimensional Hilbert space. This classification theorem showed a 1-to-l cor- 
respondence between D-dimensional strongly complementary pairs of observables and Abelian 
groups of order D. 

After this, we showed how certain kinds of highly entangled states, called Frobenius states, in- 
duce Frobenius algebras. Furthermore, the two canonical tripartite qubit states — GHZ and W — are 
distinguished by a simple condition on these Frobenius algebras: specialness or anti-specialness. 
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A Frobenius state in C 2 (g> C 2 ® C 2 is SLOCC-equivalent to GHZ if and only if its induced com- 
mutative Frobenius algebra is special, and a Frobenius state is SLOCC-equivalent to W if and only 
if its induced CFA is anti-special. 

Drawing on the interaction properties of these canonical qubit states, we introduced the theory 
of GW-pairs. These abstract the interaction properties of the Frobenius algebras associated with 
GHZ and W, and can be used to study arbitrary multipartite entangled states. Focusing on the 
specific case of GHZ and W, we showed that the generators of the GW-pair are universal for quan- 
tum computing and characterised the behaviour of the GHZ- and W-algebras in terms of (partial) 
arithmetic operations defined on the complex projective line CP . 



Finally, in part III we introduced the Quantomatic project, which consists of tools for the au- 
tomatic construction and manipulation of string graph rewrite theories. We illustrated how the 
process of conjecture synthesis introduced by Johansson, Dixon, and Bundy Il33l can be adapted 
to the setting of string graphs, where each graphical generator is given a concrete valuation as a 
linear operator. Using this technique, it becomes practical to enumerate all of the graphical identi- 
ties exhibited by a set of generators under composition for small- to medium-sized string graphs. 
These tools, along with the methods they employ, show great potential for changing the way we 
formulate and interact with a wide variety of theories involving interacting components. 

10.1 Future Work 

10.1.1 Classifying Frobenius states 



In section 8.3.2 we gave a classification result for anti-special commutative Frobenius algebras 
of dimension 2 and for special Frobenius algebras of any dimension. The natural next step is to 
ask if a reasonable classification result can be constructed for ACFAs of dimension D > 3. This 
classification is likely to be more difficult in the case of SCFAs, where classification follows from the 
fact that there are not very many semisimple, commutative X-algebras. When K is an algebraically 
closed field, the only semisimple algebras are direct sums of K itself. In the case of ACFAs of 
dimension D > 1, the vector TJ always generates a non-trivial nilpotent ideal, so non-trivial ACFAs 
are never semisimple. However, it is our hope that anti-specialness will prove a strong enough 
condition on a Frobenius algebra to yield a straightforward classification. 

We also intend to expand the classification results for Frobenius states on a different front: the 
classification of arbitrary Frobenius states for low dimensions (e.g. D < 5). This problem is tractable 
because there are relatively few commutative, unital algebras of dimension 5 or less. There is one 
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such algebra for dimension 1, two for D = 2, four for D = 3, nine for D = 4, and 20 for D = 5. 
For dimensions 3 and above, only some of these algebras extend to Frobenius algebras, and it 
becomes a straightforward task to enumerate those that are. The classification of Frobenius states 
for dimension 3 was completed this year by Honda [32 j. As in the case for two dimensions, a 
Frobenius state up to SLOCC is uniquely determined by the rank of its loop map. Thus there is 
only Frobenius state corresponding to an SCFA (full rank), one corresponding to an ACFA (rank 
1), and one corresponding to what Honda calls an intermediate special commutative Frobenius algebra 
(ICFA), which has a loop map of rank 2. The author, along with Coecke and Merry intend to 
incorporate this result into a complete classification of Frobenius states up to dimension 5 in the 
sequel to 11191 

10.1.2 Super-qubits and the W state bialgebra 

The W state exhibits some interesting properties that we have not yet fully explored. In section 
we introduced the operation ( — ) O t of transposition relative to a particular Frobenius algebra. 



3.2 



In the case of the GW-pair corresponding to the GHZ and W states, let Tj; = I Jk 1 and let 
9 = ( ^ ) ■ The following then forms a bialgebra: 

W:=(C®C,y, $,X'*) 

But there is a catch: it does not form a bialgebra in FHilb, but rather the category SuperHilb 
of super-Hilbert spaces and even mapsV] This is the category where Hilbert spaces are graded into a 
"bosonic" and a "fermionic" part. Objects are Z2-graded Hilbert spaces Ho ffi H\ and morphisms 
are linear maps / : Ho © H\ — > H' Q © Hj that respect the grading. This category is monoidally 
equivalent to the category of unitary representations of Z2, but we use a different symmetry map 
that introduces a —1 factor when a "fermionic" element crosses another one: 

Ho©Hi,h ©«i U I w "I > \\fi®\i) otherwise 

For instance, over the graded space C © C of super-qubits, the swap map is defined by the fol- 
lowing matrix: 

/l \ 



10 

10 

\0 -1/ 



Why is it interesting that W forms a bialgebra in SuperHilb, especially if Hilb is our primary 
category of interest? There is a faithful forgetful functor U : SuperHilb — > Hilb that is strongly 
monoidal, but does not preserve symmetries. As a result, any planar diagrammatic identity we can 
prove in SuperHilb also holds in Hilb. For instance, we can prove the following equation for any 
commutative bialgebra. 



1 Thanks to Jamie Vicary for pointing this out. 
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This is a bit of a contrived example, but its a special case of a general result for commutative 
bialgebras. A commutative bialgebra diagram is uniquely determined by the number of forward- 
directed paths from each input to each output. In the above equation, there are exactly 4 distinct 
paths connecting the input to the output on both the LHS and the RHS. When morphisms exhibit 
certain properties in one "categorical context" but not in another, we express this graphically, using 
the functorial boxes defined by Mellies in l50l . Starting with a diagram in Hilb, we can draw a box 
around a sub-diagram as long as the diagram (1) is planar and (2) contains only morphisms in the 
image of U. We can then rewrite the elements inside the box as if they are in Super Hilb, possibly 
breaking planarity along the way. Then, as long as the diagram is ultimately planar, we can erase 
the box. 



Hilb 







Hilb 



10.1.3 GW-pairs and strongly complementary observables 

Aside from contrived examples we have given, one might wonder if there are useful planar equa- 
tions satisfied by bialgebras. In 2008, Mellies showed that any commutative bialgebra whose 
comonoid is the transpose of its monoid, the following maps form a Frobenius algebra |49|: 



i 



Recall that the monoid, comonoid, and Frobenius identites are all planar, so the above yield a 
Frobenius algebra in both SuperHilb and Hilb. Furthermore, in the case of the W state algebra, 





this Frobenius algebra is precisely the t-SCFA for the X observable, defined in section 7.3 Expand- 
ing Jtt and (!j using the dualiser, we obtain the following expression for the Frobenius algebra 
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(c 2 ,y,?,^,6; 







(10.1) 



? 



I 



Under this encoding, we can relate the two constructions of the CNOT gate from sections 7.3.1 
and l8X2l 



< 



en— •; ' -- - — >** - = o o 



So, we know that in the particular case of the GHZ/W Frobenius algebra pair, we can construct 
the Frobenius algebras for the strongly complementary observables Z and X. However the general 
relationship between GW-pairs and strongly complementary pairs is still unknown. However, we 
conjecture that the axioms of a distributive GW-pair subsume those of a strongly complementary 
pair. 

Conjecture 10.1.1. Let ( JS, 4) be a distributive GW-pair. Then, for a third Frobenius algebra 

( XJ , 9/ j~X r 6 ) defined as in 1 10.1 1 above, ( Jji , Jji ) forms a strongly complementary pair of 
t-SCFAs. 



Another area for future work is the conceptual understanding of strong complementarity. In 



section 7.2 we provided a complete classification for pairs of strongly complementary observables. 
While we have a clear idea of what strong complementary means mathematically, a physical in- 
terpretation of strong complementary is still missing. The fact that a particular pair of strongly 
complementary observables (Pauli Z and X) play such a central role in the study of complementar- 
ity in finite dimensions suggests that such an interpretation exists. As a first step toward finding 
this interpretation, we are looking for quantum protocols and theorems that rely crucially on certain 



forms of the bialgebra equations given in Definition 7.2.3 



10.1.4 Knuth-Bendix completion for string graphs 

Knuth-Bendix completion is a procedure for turning terminating, non-confluent rewrite systems 
into terminating, confluent rewrite systems. It works by identifying critical pairs for a rewrite sys- 
tem. For a finite set of term rewrite rules, we can always identify a finite set of terms s E S that 
represent all of the "possible ways" in which the left-hand sides of two rewrite rules can overlap. 
A critical pair is then a pair of distinct, normalised terms t\, i-i that were both rewritten from some 
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such term s. A rewrite system is confluent precisely when it has no critical pairs. Knuth-Bendix 
completion takes a rewrite system R and a strict, total reduction ordering w on terms, and operates 
as follows: 

1. Compute all of the critical pairs for R. 

2. For each critical pair (t\, t?), add t\ — > ti to R if w{t\) > wfa)- Add ii — > t\ otherwise. 

3. Repeat until there are no critical pairs. 

There are two possible outcomes for this procedure: (1) it halts and produces a confluent, termi- 
nating rewrite system, or (2) it keeps producing more an more rewrite rules forever. Since arbitrary 
word problems can be encoded as terminating term rewrite systems, there must exist some rewrite 
systems for which the Knuth-Bendix procedure does not halt. However, for many useful classes of 
rewrite systems, this procedure always halts, yielding a terminating, confluent rewrite system. 

As an example, let / < XfXj, . . . , X n ] be some ideal of a polynomial ring. It is a well-known 
fact that the ideal membership problem is decidable precisely when one can find a special set of 
polynomials generating / called a Grobner basis. There is a natural way to consider a particular 
polynomial /, as a rewrite rule on other polynomials. 



X\X 2 + 4X 2 + 2 — >• (x?X 2 — > -4X 2 



This rule then rewrites certain polynomials P that are "matched" by /; to P — /,. Grobner bases 
are then exactly those sets of polynomials generating / that, considered as rewrite systems, are 
terminating and confluent. A crucial tool for computer algebra systems is Buchberger's algorithm, 
which derives Grobner bases from arbitrary finite sets of polynomials. This algorithm is precisely 
Knuth-Bendix completion applied to polynomial rewrite systems [6J . 

Confluence for diagrammatic rewrite systems can be more subtle than term rewrite systems. 
For example, it was shown by Lafont that for general diagrammatic rewriting, finite diagrammatic 
rewrite systems could lead to infinite families of critical pairs exhibiting what he calls global con- 
flicts [45 J. However, this problem can be overcome by performing critical pair analysis on diagrams 
with "gaps", as Mimram did in his thesis |[5T| to demonstrate a locally confluent presentation of 
Mat(N). 

The source of the subtlety here lies in the fact that the diagrams considered by Lafont et al exist 
in arbitrary monoidal categories. Any additional categorical structure (like symmetries and duals) 
are treated "opaquely" as morphisms in a 2D grid. String graph rewrite systems rely crucially on 
the fact that the traced symmetric structure of the category is built in to the graphical representation 
of the string graph. Therefore much of the subtlety of the Lafont-style approach falls away (or more 
precisely, is absorbed in graph isomorphism) at the expense of generality. 
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In 2008, Kissinger defined a Knuth-Bendix procedure for diagrams of interacting commutative 
Frobenius algebras and applied it by hand to derive a confluent fragment of the Z/X-calculus [38|. 
Extending this to general string graph theories is straightforward, and its implementation in Quan- 
tomatic will be a useful tool both on its own and as a component of more sophisticated procedures. 

10.1.5 Pattern graphs and graphical inductive reasoning 

We often wish to work with graphical generators that have commutative inputs and outputs with 
variable arities. We can encode this into the usual string graph formalism by adding a box type for 
every possible arity and adding rewrite rules for commutativity However, experience has showed 
us that there is much to be gained by encoding as much symmetry into the representation of an 
algebraic system as possible. For instance, in term rewrite theories, once a function is assumed to 
be commutative, its arguments are treated as a multiset, rather than an ordered list. We incorporate 
this into the theory of string graphs by introducing string graphs with arities. 

Monoidal signatures are replaced by signatures with arities T = (O, M,dom,cod). For PN the 
powerset of N, the functions: 

dom : M ->• w(0 x PN) cod : M ->■ w(0 x PN) 

assign a morphism to a list of pairs (o 6 O, A C N). The element o is the type of the input 
or output as before, and the set A is the set of allowed arities. The requirement that the typing 
maps Tq : G — » Gj for string graphs must be local isomorphisms around box vertices is replaced 
with the requirement that these maps respect arities. For an edge e in the typegraph, we require 
that e occurs with an allowed arity around every box-vertex v 6 B(G). Formally, for all box- 
vertices v 6 B ( V) where t g (v) = f and all edge types iiy /Z -, out^y such that dom(/) [i] = (o,A) and 
cod (/)[/] = (o',A') the following equations must hold: 

IKrHin^EA 

(10.2) 
\(T£)-Hout f/j )\eA> 

Using arities admits a great deal of flexibility in string graph signatures. The usual notion of 
string graphs without arities is recovered by letting all of the sets of allowed arities by {1 }, because 
Tq is a local isomorphism iff the inverse images defined in 1 10. 2} are always of cardinality 1 . The 



other extreme is where all of the sets A = IN, where any arity is allowed. This is incredibly useful 



when working with spiders, as defined in section 3.2.1 In fact, this is default mode for Quan- 
tomatic, as it was originally designed to work with "spider-based" graphical languages like those 
defined in part IT] In between these two extremes, one could define optional (non-commutative) in- 
puts and outputs by setting A = {0, 1} and (fixed arity) commutative inputs and outputs by setting 
A = {k}. 
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One of the most useful things about having generators with variable arities (as opposed to just 
introducing new generators for every arity) is that we can define pattern graphs. These are graphs 
where certain portions of a graph (and their incident edges) can be duplicated any number of 
times. We define pattern graphs by introducing [-boxes (pronounced "bang boxes") around string 
subgraphs. Intuitively pattern graphs represent a set of concrete string graphs where each of the 
!-boxes can occur or more times. 

Example 10.1.2. A pattern graph with two !-boxes, and the set of concrete graphs it represents. 



a: 

b; 


$ 




= < 




i 




V 



I 



o 



o 



' I ' 



• • • 



The real power of pattern graphs comes from the ability to define pattern graph rewrite rules. 
These consist of a pair of pattern graphs, and a suitable bijection between the !-boxes on the LHS 
and RHS such that we can represent an infinite set of valid string graph rewrite rules. For instance, 
a rewrite rule to merge two vertices of any arity can be expressed as: 




(10.3) 



A pattern graph rule can be instantiated into a concrete rule by replacing a single !-box with N 
copies of that !-box on the LHS and replacing its corresponding !-box with N copies on the RHS. 
For example, the following is an instance of the rule above: 




Although a preliminary implementation of ! -boxes already exists in Quantomatic, to assure its 
validity, we need to formalise string graphs with arities and pattern graph matching and rewriting 
within the framework of partial adhesive categories. In many ways this is a straight-forward task, 
but care must be taken when defining the correct notions of matching and rewriting. 

We can also do rewriting on pattern graphs themselves. That is, we can use infinite sets of rules 
(i.e. pattern graph rewrite rules) to reason about infinite sets of graphs (i.e. pattern graphs). 

One of the most interesting applications for pattern graph rewriting is combining the conjecture 
synthesis procedure with Knuth-Bendix completion to automatically generate new pattern graph 
rewrite rules. IsaScheme, a tool for "scheme-based" conjecture synthesis [52|, has already had some 
success in this area for term-based theories. 
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As it stands, the synthesis procedure can only discover new concrete rewrite rules, however, 
using Knuth-Bendix, it could automatically combine its pre-existing knowledge (in the form of an 
initial set of pattern graph rewrite rules) with its findings. 

Consider a case where we initiate the synthesis procedure with the rewrite rule given by 1 10.3) . 
At some point, it discovers a new identity: 



T 



TT 



(10.4) 



If we perform Knuth-Bendix on a rewrite system, we obtain a critical pair. 





T 



Then, under a suitable ordering on string graphs, we can consider the string graph on the right 
to be more reduced than the string graph on the left, so we introduce a new pattern-graph rewrite 
rule. 




T 



This rule is stronger than 1 10.4 1, and can be thought of as the "spiderised" version of that rule. 
Since this rule is stronger than the previous one, there are more reducible expressions in the rewrite 
system being synthesised, and hence a smaller search space for string graph enumeration. This sug- 
gests that incorporating a Knuth-Bendix step into the conjecture synthesis procedure could vastly 
improve its performance as well as generate fewer, more powerful graphical identities. 

In addition to pure equational reasoning (i.e. rewriting proofs) with ! -boxes, we can do some 
inductive reasoning as well. Suppose we extend the language of !-boxes, allowing them to be 
bound to expressions over natural numbers, possibly containing free variables. 

U\ «2 n l n 2 



X> 




n 3 n 4 n 3 n 4 

We interpret this rule as the set of all concrete rules where the z'-th !-box is duplicated n, times. 
For non-atomic expressions (i.e. expressions that are not just a single free variable or constant), it 
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could become tricky to prove that any substitution of free variables yields valid concrete rewrite 
rule without ambiguity. However, if this can be done even in limited cases, such a language allows 
one to define a notion of induction, in the form of an inference rule on graphical identities. 
m n m — 1 n + 1 



ind 



This rule says, "If one can convert a single copy of a !-box to a single copy of another !-box, then 
one can convert k copies of that !-box for any k." Note that the base case is trivially satisfied: if we 
kill both ! -boxes, we are left with G = G. 

Example 10.1.3. Using the ind inference rule to prove a new pattern graph identity. Take the 
following rules as given: 





G 



TT 



9— 



A 



Then, we will use the induction principal to prove: 



• 


T 

• 



First, we derive the hypothesis for ind using rewriting. 



T 

i 



T 



T 



m n 



m — 1 n 




T 




T 



T 



m — 1 n 

The general rule is then constructed with an application of ind. 



m — 1 n 



m — 1 n + 1 
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T 

• 


• 


T 

• 



m n 



m 



1 n 



ind 



T 



Note that this identity could not be produced using purely-equational means (e.g. using Knuth- 
Bendix completion), as none of the equations that we assumed contain the vertex • inside of a 
!-box. 

The proof above is quite similar in form to the types of inductive proofs checked by automated 
proof assistants. In principal, it would be straightforward to verify this proof in Quantomatic. 
Perhaps more advanced techniques like those employed by the proof-planning tool IsaPlanner l23l 
(e.g. rippling) could be adapted to string graphs to automatically search for such proofs as well. 

Of course, !-boxes are not the only way to compactly represent infinite sets of string graphs. 
We could also describe sets of graphs by introducing a "meta" rewrite system, where certain types 
of rewrite rules are treated as productions in a graph grammar. In fact, this was the terminology 
originally used by the graph rewriting literature in the 1970s [27 1. These rules would not need to 
respect inputs and outputs to a string graph, but some provision would need to be made when 
applying a meta-rule to a normal string graph rewrite rule to ensure that both the LHS and RHS 
are expanded in the same way. One could think of pattern graphs, as defined in this section, as 
something akin to regular languages, whereas sets of graphs described by graph grammars are 
richer (e.g. context-free or context-sensitive) languages. It is the hope that increasingly sophisti- 
cated graphical languages will lead to increasingly elegant and powerful graphical theories with 
applications in physics, linguistics, logic, and beyond. 
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